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NOVEL ORGANIC ANION TRANSPORT PROTEINS 

This application claims priority from provisional U.S. Application Serial No. 
60/135,081, filed May 20, 1999, which is incorporated herein by reference in its 
entirety. 



Field of the Invention 

The invention claims isolated nucleic acid encoding all or a portion of novel 
10 members of the organic anion transport protein ("OATP") designated OATP2, OATP- 
RP1, OATP-RP2, OATP-RP3, OATP-RP4 and OATP-RP5. Also claimed are vectors 
containing the nucleic acid sequences, host cells containing the vectors and 
polypeptides having all or part of the amino acid sequence of OATP2, OATP-RP 1 , 
OATP-RP2, OATP-RP3, OATP-RP4 and OATP-RP5. Tissue expression of the 
15 transporter is described as well as some of its substrates. Also claimed are uses for 
these novel OATPs, including for targeting drugs to specific tissues, for modulating 
the concentration of endogenous substrates, and for identifying a substrate capable of 
being transported by a novel OATP of the invention. * 

20 Background of the Invention 

The liver functions in the clearance of a large variety of metabolic products, 
drugs and other xenobiotics by transporting them across the sinusoidal membrane into 
the hepaiocyte. Several classes of transport systems have been described that mediate 
these processes including the Na+/taurocholate cotransporter polypeptide, NTCP, in 

25. rat and human liver (Hagenbuch, B., et al. (1991) Proc. Natl. Acad. Sci. USA 

88:10629-33; Hagenbuch, B. et al., (1994) J. Clin. Invest. 93:1326-31) and a family of 
organic anion transporting polypeptides (OATPs) that are principally expressed in 
liver, kidney and brain, and transport a broad spectrum of substrates in a sodium- 
independent manner (Meier, PJ., et al., ( 1 997) Hepatology 26: 1 667-77; Wolkoff, . 

30 A.W., (1996) Semin. Liver Dis. 16:121-127). The distribution of this latter family of 
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transporters in liver, kidney and choroid plexus in the brain is thought to reflect 
common physiological requirements of these organs for the clearance of a multitide of 
organic anions. There are three OATP isoforms in the rat: roatpl (Jacquemin, E., et 
al., (1994) Proc. Natl Acad. Sci. USA 91:133-37); roatp2 (Noe, B.A., et al., (1997) 
Proc. NatL Acad. Sci. USA 94:10346-50; and roatp3 (Abe, T., et al, (1998) J. Biol. 
Chem. 273:1 1395-401). In addition to bile acids, OATPs are known to transport a 
variety of other compounds. These include, depending on the transporter, 
unconjugated and conjugated steroids such as estrone sulfate, estradiol- 17B- 
glucuronide, aldosterone, and cardiac glycosides (Boussuyt, X., et al., (1996)7. 
Pharmacol Exp. Ther. 276:891-6; Boussuyt, X. (1996) J. Hepatol 25:733-8; Kanai, 
R, et al., (1996) Am. J. Physiol 270:F319-F325; Kanai, N M et al., (1996) Am. J. 
Physiol 270:F326-F331; Noe, B.A., et al., (1997) Proc. Natl Acad. Sci USA 
94:10346-50). Bromosulfophthalien (Jacquemin, E., et al., (1994) Proc. Natl Acad. 
Sci. USA 91:133-7); mycotoxin (Kontaxi, M , et al., (1996) J. Pharmacol Exp. Ther. 
279:1507-13); leukotriene C 4 (Li, L., et al., (1998) J. Biol Chem. 273:16184-91); and 
thyroid hormone (Abe, T., et al., ( 1 998) J. Biol Chem. 273:11 395) are additional 
substrates. 

Several proteins have been identified. Jacquemin,E., et al., (1994) Proc. Natl 
Acad. Sci. U.S.A., 91:133-137 reported the first cloning and identification of a 
member of the OATP transporter family, namely the rat oatpl . The first cloning and 
identification of a human OATP was reported in Kullak-Ublick, G.A., et al., (1995) 
Gastroenterology, 109:1274-1282. Its expression was found in liver, kidney brain 
and other organs. The authors concluded, based on substrate specificities, that it was 
not the human orthologue of rat oatpl . 

Substrate specificities of rat oatpl are discussed in Kullak-Ublick, G.A. et al., 
(1994) Hepatology, 20:41 1-416, while substrate specificities of human OATP are 
discussed in Bossuyt, X., et al., (1996) J. Hepatol, 25:733-738. 

Data was later discovered showing that rat oatpl is involved in the transport of 
steroids (Bossuyt, X., et al., (1996) J. Pharmacol Exp. Ther., 276:891-896), and that 
human OATP acts as a transporter for the psychoactive hormone DHEAS (Kullak- 
Ublick, G.A., et al., (1998) FEBS Lett., 424:1 73-176). For a review of the OATP 
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family and organic anoin transport in the liver, see Wolkoff, A.W., (1996) Semin. 
. Liver Dis., 16:121-127. 

A third rat OATP isoform that was shown to transport thyroid hormones T3 
and T4 was cloned and reported in Abe,T., et al., (1998) J. Biol, Chem., 273:22395- 
5 22401. 

All references cited herein, whether supra or infra, are hereby incorporated by 
reference in their entirety. 



Summary of the Invention 

10 The present invention encompasses novel organic anion transport proteins 

("OATP") and polynucleotides encoding said OATPs. The OATPs disclosed herein 
are designated OATP2, OATP-RP2, OATP-RP3, OATP-RP4, OATP-RP5 and 
OATP-RP 1 • A polynucleotide sequence of each OATP is disclosed herein, along 
with the deduced amino acid sequence. The cDNAs encoding the OATPs of the 

15 present invention have been deposited with the American Type Culture Collection 
and given Accession Numbers ATCC 207213 (OATP2), ATCC 207212 (OATP- 
RP2), ATCC 207209 (OATP-RP3), ATCC 207210 (OATP-RP4), ATCC 20721 1 
(OATP-RP5), and ATCC 20721 4 (OATP-RP1 ). 

The present inventors sequenced the cDNAs encoding the novel OATPs and 

20 determined the primary sequence of the deduced proteins. Disclosed herein are the 
nucleic acid sequence (SEQ ID NO:l) and amino acid sequence (SEQ ID NO:2) of 
OATP2; the nucleic acid sequence (SEQ ID NO:3) and amino acid sequence (SEQ ID 
NO:4) of OATP-RP2; the nucleic acid sequence (SEQ ID NO:5) and amino acid 
sequence (SEQ ID NO:6) of OATP-RP3; the nucleic acid sequence (SEQ ID NO:7) 

25 and amino acid sequence (SEQ ID NO:8) of OATP-RP4; the nucleic acid sequence 
(SEQ ID NO:9) and amino acid sequence (SEQ ID NO: 1 0) of OATP-RP5; and the 
nucleic acid sequence (SEQ ID NO:l 1) and amino acid sequence (SEQ ID NO: 12) of 
OATP-RP1. 

The OATPs of the present invention can be produced by: ( 1 ) inserting the 
30 cDNA of a disclosed OATP into an appropriate expression vector; (2) transfecting the 
expression vector into an appropriate transfection host(s); (3) growing the transfected 
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. host(s) in appropriate culture media; and (4) assaying the transport activity in the 
transfected cells. 

The present invention therefore provides a purified and isolated nucleic acid 
molecule, preferably a DNA molecule, having a sequence which codes for an OATP, 

5 or an oligonucleotide fragment of the nucleic acid molecule which is unique to an 
OATP of the invention. In a preferred embodiment of the invention, the purified and 
isolated nucleic acid molecule has the sequence as shown in SEQ ID NO:l (OATP2). 
In another preferred embodiment, the purified and isolated nucleic acid molecule has 
the sequence as shown in SEQ ID NO:3 (OATP-RP2). In still another preferred 

10 embodiment the purified and isolated nucleic acid molecule has the sequence as 
shown in SEQ ID NO:5 (OATP-RP3). In still another preferred embodiment of the 
present invention the purified and isolated nucleic acid molecule has the nucleotide 
sequence as shown in SEQ ID NO:7 (OATP-RP4). In still another preferred 
embodiment the purified and isolated nucleic acid molecule has the sequence as 

15 shown in SEQ ID NO:9 (OATP-RP5). In still another preferred embodiment of the 
present invention the purified and isolated nucleic acid molecule has the nucleotide 
sequence as shown in SEQ ID NO: 1 1 (OATP-RP 1 ). 

The invention also contemplates a double stranded nucleic acid molecule 
comprising a nucleic acid molecule of the invention or an oligonucleotide fragment 

20 thereof hydrogen bonded to a complementary nucleotide base sequence. 

The terms "isolated and purified nucleic acid", "isolated and purified 
polynucleotide", "substantially pure nucleic acid", and "substantially pure 
polynucleotide", e.g., substantially pure DNA, refer to a nucleic acid molecule which 
is one or both of the following: (1) not immediately contiguous with either one or 

25 both of the sequences, e.g., coding sequences, with which it is immediately 

contiguous (i.e., one at the 5 ' end and one at the 3 'end) in the naturally occurring 
genome of the organism from which the nucleic acid is derived; or (2) which is 
substantially free of a nucleic acid sequence with which it occurs in the organism 
from which the nucleic acid is derived. The term includes, for example, a 

30 recombinant DNA which is incorporated into a vector, e.g., into an autonomously 
replicating plasmid or vims, or into the genomic DNA of a prokaryote or eukaryote, 
or which exists as a separate molecule (e.g., a cDNA or a genomic DNA fragment 
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produced by PCR or restriction endonuclease treatment) independent of other DNA 
sequences. Substantially pure or isolated and purified DNA also includes a 
recombinant DNA which is part of a hybrid gene encoding additional OATP 
sequence. 

S The present invention provides in one embodiment: (a) an isolated and 

purified nucleic acid molecule comprising a sequence encoding all or a portion of a 
protein having the amino acid sequence as shown in SEQ ID NO:2 (OATP2); (b) 
nucleic acid sequences complementary to (a); (c) nucleic acid sequences which 
exhibit at least 80%, more preferably at least 90%, more preferably at least 95%, and 

10 most preferably at least 98% sequence identity to (a); or (d) a fragment of (a) or (b) 
that is at least 1 8 bases and which will hybridize to (a) or (b) under stringent 
conditions. 

The degree of homology (percent sequence identity) between two sequences 
may be determined, for example, by comparing the two sequences using computer 

1 5 programs commonly employed for this purpose. One suitable program is the GAP 
computer program described by Devereux et al., (1984) NucL Acids Res. 12:387. The 
GAP program utilizes the alignment method of Needleman and Wunsch (1 970) J. 
MoL Biol. 48:433, as revised by Smith and Waterman (\9Sl) Adv. AppL Math. 2:482. 
Briefly, the GAP program defines percent identity as the number of aligned symbols 

20 (i.e., nucleotides or amino acids) which are identical, divided by the total number of 
symbols in the shorter of the two sequences. 

As used herein the term "stringent conditions" encompasses conditions known 
in the art under which a nucleotide sequence will hybridize to: (a) an isolated and 
purified nucleic acid molecule comprising a sequence encoding a protein having the 

25 amino acid sequence as shown herein, or to (b) a nucleic acid sequence 

complementary to (a). Screening polynucleotides under stringent conditions may be 
carried out according to the method described in Nature, 3 1 3 :402-404 ( 1 985). 
Polynucleotide sequences capable of hybridizing under stringent conditions with the 
polynucleotides of the present invention may be, for example, allelic variants of the 

30 disclosed DNA sequences, or may be derived from other sources. General techniques 
of nucleic acid hybridization are disclosed by Sambrook et al.. "Molecular Cloning: A 
Laboratory Manual", 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, 
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New York (1984); and by Haymes et ah, "Nucleic Acid Hybridization: A Practical 
Approach", IRL Press, Washington, D.C. (1985), which references are incorporated 
herein by reference. 

The present invention provides in another embodiment: (a) an isolated and 

5 purified nucleic acid molecule comprising a sequence encoding all or a portion of a 
protein having the amino acid sequence as shown in SEQ ID NO:4 (OATP-RP2); (b) 
nucleic acid sequences complementary to (a); (c) nucleic acid sequences which are at 
least 80%, more preferably at least 90%, more preferably at least 95%, and most 
preferably at least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is 

10 at least 1 8 bases and which will hybridize to (a) or (b) under stringent conditions. 

The present invention provides in another embodiment: (a) an isolated and 
purified nucleic acid molecule comprising a sequence encoding all or a portion of a 
protein having the amino acid sequence as shown in SEQ ID NO:6 (OATP-RP3); (b) 
nucleic acid sequences complementary to (a); (c) nucleic acid sequences which are at 

15 least 80%, more preferably at least 90%, more preferably at least 95%, and most 

preferably at least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is 
at least 1 8 bases and which will hybridize to (a) or (b) under stringent conditions. 

The present invention provides in another embodiment: (a) an isolated and 
purified nucleic acid molecule comprising a sequence encoding all or a portion of a 

20 protein having the amino acid sequence as shown in SEQ ID NO:8 (OATP-RP4); (b) 
nucleic acid sequences complementary to (a); (c) nucleic acid sequences which are at 
least 80%, more preferably at least 90%, more preferably at least 95%, and most 
preferably at least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is 
at least 1 8 bases and which will hybridize to (a) or (b) under stringent conditions. 

25 The present invention provides in another embodiment: (a) an isolated and 

purified nucleic acid molecule comprising a sequence encoding all or a portion of a 
protein having the amino acid sequence as shown in SEQ ID NO: 10 (OATP-RP5); (b) 
nucleic acid sequences complementary to (a); (c) nucleic acid sequences which are at 
least 80%, more preferably at least 90%, more preferably at least 95%, and most 

30 preferably at least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is 
at least 1 8 bases and which will hybridize to (a) or (b) under stringent conditions. 
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The present invention provides in another embodiment: (a) an isolated and 
purified nucleic acid molecule comprising a sequence encoding all or a portion of a 
protein having the amino acid sequence as shown in SEQ ID NO:12 (OATP-RP1); (b) 
nucleic acid sequences complementary to (a); (c) nucleic acid sequences which are at 

5 least 80%, more preferably at least 90%, more preferably at least 95%, and most 
preferably at least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is 
at least 1 8 bases and which will hybridize to (a) or (b) under stringent conditions. 

The present invention also provides: (a) a purified and isolated nucleic acid 
molecule comprising a sequence as shown in SEQ ID NO:l (OATP2); (b) nucleic 

10 acid sequences complementary to (a); (c) nucleic acid sequences having at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is at least 1 8 
bases and which will hybridize to (a) or (b) under stringent conditions. 

The present invention further provides: (a) a purified and isolated nucleic acid 

15 molecule comprising a sequence as shown in SEQ ID NO:3 (OATP-RP2); (b) nucleic 
acid sequences complementary to (a); (c) nucleic acid sequences having at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is at least 1 8 
bases and which will hybridize to (a) or (b) under stringent conditions. 

20 The present invention further provides: (a) a purified and isolated nucleic acid 

molecule comprising a sequence as shown in SEQ ID NO:5 (OATP-RP3); (b) nucleic 
acid sequences complementary to (a); (c) nucleic acid sequences having at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is at least 1 8 

25 bases and which will hybridize to (a) or (b) under stringent conditions. 

The present invention further provides: (a) a purified and isolated nucleic acid 
molecule comprising a sequence as shown in SEQ ID NO:7 (OATP-RP4); (b) nucleic 
acid sequences complementary to (a); (c) nucleic acid sequences having at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 

30 least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is at least 1 8 
bases and which will hybridize to (a) or (b) under stringent. conditions. 
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The present invention further provides: (a) a purified and isolated nucleic acid 
molecule comprising a sequence as shown in SEQ ID NO:9 (OATP-RP5); (b) nucleic 
acid sequences complementary to (a); (c) nucleic acid sequences having at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is at least 1 8 
bases and which will hybridize to (a) or (b) under stringent conditions. 

The present invention further provides: (a) a purified and isolated nucleic acid 
molecule comprising a sequence as shown in SEQ ID NO:l 1 (OATP-RP1); (b) 
nucleic acid sequences complementary to (a); (c) nucleic acid sequences having at 
least 80%, more preferably at least 90%, more preferably at least 95%, and most 
preferably at least 98% sequence identity to (a); or (d) a fragment of (a) or (b) that is 
at least 1 8 bases and which will hybridize to (a) or (b) under stringent conditions. 

The present invention additionally covers polynucleotides and amino acid 
sequences of the present invention having one or more structural mutations including 
replacement, deletion or insertion mutations. For example, a signal peptide may be 
deleted, or conservative amino acid substitutions may be made to generate a protein 
that is still biologically competent or active. 

The invention further contemplates a recombinant molecule comprising a 
nucleic acid molecule of the present invention or an oligonucleotide fragment thereof 
and an expression control sequence operatively linked to the nucleic acid molecule or 
oligonucleotide fragment. A transformant host cell including a recombinant molecule 
of the invention is also provided. 

In another aspect, the invention features a cell or purified preparation of cells 
which include a novel gene encoding an OATP of the present invention, or which 
otherwise misexpresses a gene encoding an OATP of the present invention. The cell 
preparation can consist of human or non-human cells, e.g., rodent cells, e.g., mouse or 
rat cells, rabbit cells, non-human primate cells, or pig cells. In preferred 
embodiments, the cell or cells include an OATP transgene, e.g., a heterologous form 
of an OATP gene, e.g., a gene derived from humans (in the case of a non-human cell). 
The OATP transgene can be misexpressed, e.g., overexpressed or underexpressed. In 
other preferred embodiments, the cell or cells include a gene which misexpresses an 
endogenous OATP gene, e.g., a gene that expression of which is disrupted, e.g., a 

8 
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knockout. Such cells can serve as a model for studying disorders which are related to 
mutated or misexpressed OATP alleles for use in drug screening. 

Still further the invention provides plasmids which comprise the nucleic acid 
molecules of the invention. Also encompassed within the invention are vectors 

5 comprising the nucleic acid sequences disclosed herein, as well as host cells 
comprising said vectors. 

The present invention also includes a novel OATP of the present invention, or 
an active part thereof. A biologically competent or active form of the protein or part 
thereof is also referred to herein as an "active OATP or part thereof'. 

10 The invention further contemplates antibodies having specificity against an 

epitope of an OATP of the present invention or part of the protein. These antibodies 
may be polyclonal or monoclonal. The antibodies may be labeled with a detectable 
substance and they may be used, for example, to detect a novel OATP of the 
invention in tissue and cells. Additionally, the antibodies of the present invention, or 

15 portions thereof, may be used to make targeted antibodies that destroy OATP 

expressing cells (e.g., antibody-toxin fusion proteins, or radiolabeled antibodies). 

The invention also permits the construction of nucleotide probes which encode 
part or all of a novel OATP protein of the invention or a part of the protein. Thus, the 
invention also relates to a probe comprising a nucleotide sequence coding for a 

20 protein, which displays the properties of a novel OAT^ of the invention or a peptide 
unique to the protein. The probe may be labeled, for example, with a detectable (e.g., 
radioactive) substance and it may be used to select from a mixture of nucleotide 
sequences a nucleotide sequence coding for a protein which displays the properties of 
a novel OATP of the invention. 

25 The present invention also provides a transgenic non-human animal (e.g., a 

rodent, e.g., a mouse or a rat, a rabbit or a pig) or embryo all of whose germ cells and 
somatic cells contain a recombinant molecule of the invention, preferably a 
recombinant molecule comprising a nucleic acid molecule of the present invention 
encoding an OATP of the invention or part thereof. The recombinant molecule may 

30 comprise a nucleic acid sequence encoding an OATP of the present invention with a 
structural mutation, or may comprise a nucleic acid sequence encoding an OATP of 
the invention or part thereof and one or more regulatory elements which differ from 
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the regulatory elements that drive expression of the native protein. In another 
preferred embodiment, the animal has an OATP gene which is misexpressed or not 
expressed, e.g., a knockout. Such transgenic animals can serve as a model for 
studying disorders that are related to mutated or misexpressed OATPs of the present 
5 invention. 

The invention still further provides a method for identifying a substance which 
is capable of binding a novel OATP of the invention, comprising reacting a novel 
OATP of the invention or part of the protein under conditions which permit the 
formation of a complex between the substance and a novel OATP protein or part of 

10 the protein, and assaying for substance-OATP complexes, for free substance, for non- 
complexed OATP, or for activation of an OATP. 

An embodiment of the invention provides a method for identifying substrates 
which are capable of binding to a novel OATP protein of the invention, isoforms 
thereof, or part of the protein, said method comprising reacting a novel OATP protein 

15 of the invention, isoforms thereof, or part of the protein, with at least one substrate 
which potentially is capable of binding to the protein, isoform, or part of the protein, 
under conditions which permit the formation of substrate-transporter protein 
complexes, and assaying for substrate-transporter protein complexes, for free 
substrate, for non-complexed OATP protein, or for activation of an OATP. In a 

20 preferred embodiment of the method, substrates are identified which are capable of 
binding to and being transported by a novel OATP protein of the invention, isoforms 
thereof, or part of the protein. 

The invention also provides methods for screening potentially useful 
pharmacological agonists or antagonists of the OATPs of the present invention. The 

25 method comprises testing potential agents by adding the agent to be tested to a cell 
expressing a novel OATP of the present invention in the presence of a compound 
known to be transported by an OATP of the invention, and measuring the 
augmentation or inhibition of transport of the known compound. 

An OATP of the present invention is also useful to identify compounds that 

30 may be transported into an organ, e.g., the liver. Compounds that are found to be 

actively transported into the liver are useful as carriers for other therapeutics targeting 
the liver. 
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Also included within the scope of the present invention is a composition which 
includes an OATP of the present invention, a fragment thereof (or a nucleic acid 
encoding said OATP or fragment thereof) and one or more additional components, 
e.g., a carrier, diluent or solvent. The additional component can be one that renders 
5 the composition useful for in vitro, in vivo, pharmaceutical or veterinary use. 

Encompassed within the present invention are agonists and antagonists of an 
OATP of the present invention. Pharmacological agonists or antagonists are useful to 
increase or decrease the flow of compounds transported by an OATP of the present 
invention. Said agonists and/or antagonists of the present invention are preferably 
10 administered with an acceptable carrier, diluent or solvent. 

In another aspect, the present invention relates to a method of treating a 
mammal, e.g., a human, at risk for a disorder, e.g., a disorder characterized by 
aberrant or unwanted level or biological activity of an OATP of the present invention. 
Additionally, encompassed within the invention is a method of treating a mammal, 
15 e.g., a human, at risk for disorders of the liver. Since OATP2 is expressed exclusively 
in the liver, compounds that are optimized for OATP2 are useful to target hepatic 
delivery. These compounds in themselves may be useful therapeutics, or may be 
useful to chaperone other therapeutic compounds to the liver. In addition, blocking 
OATP2-compound interactions could provide benefit by decreasing its first-pass 
20 extraction by the liver and, thus, increasing plasma concentrations and prolonging the 
systemic half life of a drug. 

Also within the scope of the present invention are fusion proteins comprising 
all or a portion of an OATP of the present invention. 

25 The primary object of the present invention is the identification of new human 

OATPs, as identified by the nucleic acid and amino acid sequences disclosed herein. 
Additional objects of the invention are the methods of using the cDNA, the OATP 
proteins, monoclonal antibodies specific for the novel OATPs, fusion proteins 
comprising a portion of the OATP protein of the present invention, and agonists 

30 and/or antagonists of the novel OATPs as described above. 
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Brief Description of the Figures 

Figure 1 is a Northern blot showing the mRNA tissue distribution of OATP2, 

OATP-RP1, OATP-RP2, OATP-RP4, and OATP-RP5. The tissues corresponding to 

the abbreviations above the lanes are indicated below. 
5 Figure 2 shows that OATP2 transports pravastatin, dehydroepiandosterone 

sulfate (DHEAS), taurocholate and thyroid hormone (T ). Figure 2A shows specific 

uptake of [ 3 H]-pravastatin and f 3 H]-DHEAS. Figure 2B shows specific uptake of 

[ 3 H]-taurocholate. Panel 2C shows specific uptake of [125I]-thyroid hormone (T4). 

The uptake of radiolabeled substrate for 5 minutes into cells transfected with 
10 pCEPOATP-RPl or empty vector (MOCK) was determined in the absence (solid 

bars) and presence (open bars) of excess unlabeled substrate. 

Figure 3 shows a sequence alignment of OATP family members. The protein 

sequences of human OATP2, OATP-RP1, OATP-RP2, OATP-RP3, OATP-RP4, and 

OATP-RP5 are aligned with the other known OATP family members. Also shown is 
15 a concensus sequence in bold. A concensus is indicated if at least 6 out of the 1 2 

sequences are identical at a given position. A residue is capitalized if it agrees with 

the concensus. 

Detailed Description of the Invention 
20 The following definitions apply to the terms used throughout this 

specification, unless otherwise defined in specific instances: 

"cloning" - isolation of a particular gene from genetic material, for example a 
genome, genomic library, or cDNA library into a plasmid or other vector; 

"coding region" - the region of a nucleic acid sequence that codes for an 
25 active protein; 

"OATP" - organic anion transport protein; 

"stringent conditions" (as used concerning nucleic acid hybridization) — 
Southern blotting washed in 0.1 X SSC and 0.1 % SDS at a temperature of at least 
about 65° C. See Maniatis et al.. Molecular Cloning: A Laboratory Manual, Cold 
30 Spring Harbor Laboratory, Cold Spring Harbor, NY (1982); one skilled in the 
relevant art would recognize that less stringent conditions (e.g., IX or 2X SSC, 
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0.1%SDS) may be employed in using the novel sequences disclosed herein to identify 
nucleic acid sequences encoding novel OATPs. 

"Northern blotting" — a method of identifying particular RNA fragments by 
hybridization with a complementary nucleic acid, typically a cDN A or an 
5 oligonucleotide; 

"open reading frame" or "ORF" — a DNA sequence containing a series of 
nucleotide triplets coding for amino acids and lacking any termination codes; 

"plasmid" — cytoplasmic, autonomously replicating DNA elements found in 
microorganisms; 

10 "promoter" — a region on DNA at which RNA polymerase binds and initiates 

transcription; and 

"Southern blotting" — a method of identifying particular DNA fragments by 
hybridization with a complementary nucleic acid, typically a cDNA or an 
oligonucleotide; 

15 "transport" - the movement of a substance across a biological membrane as 

determined by measuring the redistribution of such a substance across the membrane 

upon exposure to a transporter. 

For definitions of other terms in this specification, see F. Sherman et al., 

Laboratory Course Manual for Methods in Yeast Genetics , Cold Spring Harbor 
20 Laboratory, Cold Spring Harbor, NY (1 987) and Lewin, B., Genes IV, Oxford 

University Press, Oxford (1990). For the definitions of abbreviations, see 

Aldrichimica Acta , Vol. 1 7, No. 1 ( 1 984). 

Use and utility 

25 The amino acid sequences of the novel organic anion transport proteins of the 

present invention are aligned with known transporters of this family in Figure 3. The 
degree of sequence homology between the sequences of the present invention and 
known organic anion transporters indicates that the proteins of the present invention 
are organic anion transporters. 

30 It is believed by those skilled in the art that OATP proteins may be involved in 

the transport of compounds into the liver. Persons of ordinary skill in the art can use 
the OATP proteins of the present invention to assay for agents that may increase or 
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decrease ihe rate of transport of compounds into the liver, or for compounds that are 
transported by the OATPs of the present invention that are useful as carriers for other 
compounds that are desired to be carried to a specific organ (e.g., the liver). 

Therefore, agents that increase or decrease the rate of substrate transport by 

5 the OATPs of the present invention, or agents identified as earners, are useful in the 
treatment of liver disease. 

Because some of the OATPs of the present invention are organ 
specific/selective (e.g., OATP2 - liver; OATP-RP4 - heart and skeletal muscle, and 
OATP-RP5 - brain and testis), compound specificity is built into any specific 

10 substrate of these OATPs and into molecular carriers transported by these OATPs. 
An agent transported by the above OATPs of the present invention would thus be 
delivered to the tissues in which they are expressed and not to tissues lacking the 
above OATPs, thereby achieving tissue specific targeting. 

The OATP nucleic acids of the present invention, or antisense nucleic acids, 

15 may be useful therapeutic or diagnostic agents. For such gene therapy, the nucleic 
acids may be incorporated into vectors and/or formulated as described below and in 
further detail in the art. 

The present invention also provides a basis for diagnostic genetic screens for 
predicting response to drugs. At least one of the transporters disclosed and claimed 

20 herein is a transporter of a known drug (i.e., OATP2 transports pravastatin into 

hepatocytes). Other transporters disclosed herein may similarly transport additional 
drugs into tissues. Persons skilled in the art can: (1) screen the transporter genes for 
allelic variants (genotypes) in the general population by various sequencing methods; 
and (2) determine the association of these transporter genotypes in patients with 

25 response to the transported drug in clinical trials. Particular allelic variants may be 
more or less effective in transporting a drug, which would be related to drug efficacy. 
Thus, genotyping of the claimed transporters could form the basis of a clinical 
diagnostic test to predict a patient's response to drug therapy. 

Persons skilled in the art can use the polypeptides and nucleic acids of this 

30 invention to prepare vectors, cells or cell lines, and antibodies. All of these are useful 
in assays for identification of OATP positive and negative modulators (i.e., agonists 
and/or antagonists) and OATP carriers. The term "positive modulator" as used herein 
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refers to an agent or compound that increases the rate or amount of transport of a 
compound into an organ, e.g., the liver, or an agent or compound that decreases the 
rate or amount of transport of a compound into an organ. The term "negative 
modulator" refers to a compound that is joined to a second compound to prevent the 
5 second compounds transport into or out of cells. The term "carrier" as used herein 
refers to an agent or compound that is transported by an OATP of the present 
invention and that is capable of being joined to or associated with another compound 
to chaperone that other compound into an organ, e.g., the liver. A carrier includes an 
agent that is used to transport a compound into an organ that is otherwise not 

10 transported into said organ, and includes an agent that increases the transport of a 
compound into an organ that is capable of being transported by an OATP. 

One can administer OATP modulators and carriers to various mammalian 
species, such as monkeys, dogs, cats, mice, rats, humans, etc. By known methods, 
persons skilled in the pharmaceutical art can incorporate OATP modulators and 

15 carriers in a conventional systemic dosage form, such as a tablet, capsule, elixir or 
injectable formulation. The above dosage forms will also include any necessary 
physiologically acceptable carrier material, excipient, lubricant, buffer, antibacterial, 
bulking agent (such as mannitol), anti-oxidants (ascorbic acid or sodium bisulfite) or 
the like. 

20 Process of preparation 

In general 

This specification describes the cloning and functional expression of full- 
length human cDNA clones of OATPs, preferably the nucleic acid sequence of 
OATP2 (SEQ ID NO: 1 ), the amino acid sequence of OATP2 (SEQ ID NO:2), the 

25 nucleic acid sequence of OATP-RP2 (SEQ ID NO:3), the amino acid sequence of 
OATP-RP2 (SEQ ID NO:4), the nucleic acid sequence of OATP-RP3 (SEQ ID 
NO:5), the amino acid sequence of OATP-RP3 (SEQ ID NO:6), the nucleic acid 
sequence of OATP-RP4 (SEQ ID NO:7), the amino acid sequence of OATP-RP4 
(SEQ ID NO: 8), the nucleic acid sequence of OATP-RP5 (SEQ ID NO:9), the amino 

30 acid sequence of OATP-RP5 (SEQ ID NO: 1 0), the nucleic acid sequence of OATP- 
RP1 (SEQ ID NO:l 1), and the amino acid sequence of OATP-RP1 (SEQ ID NO: 12). 
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DNA clones comprising nucleotide sequences encoding the OATPs described 
above were deposited with the .American Type Culture Collection ("ATCC") (10801 
University Blvd.. Manassas, VA 201 10-2209) on April 20, 1999. and given the 
following ATCC Accession Numbers: 207209 (OATP-RP3), 207210 (OATP-RP4), 
5 20721 1 (OATP-RP5). 207212 (OATP-RP2), 207213 (OATP2). and 207214 (OATP- 
RP1). The deposit( s) referred to herein will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the Deposit of Microorganisms 
for purposes of Patent Procedure. These deposits are provided merely as convenience 
to those of skill in the an and are not an admission that a deposit is required under 35 

10 U.S.C. §112. The sequence of the polynucleotides contained in the deposited 
materials, as well as the amino acid sequence of the of the polypeptides encoded 
thereby, are incorporated herein by reference and are controlling in the event of any 
conflict with any description of sequences herein. A license may be required to make, 
use or sell the deposited materials, and no such license is hereby granted. 

15 Nucleic acids 

With the disclosed OATP gene sequences in hand, one skilled in the art can 
obtain OATP nucleic acids of this invention by known methods. Such methods 
include: (1) Southern and Northern blotting; (2) Western immunoblotting; (3) 
chemical synthesis; (4) synthesis by polymerase chain reaction (PCR) from primers; 

20 (5) expression cloning; and (6) subtractive cDNA cloning. 

Preferred nucleic acid sequences of the present invention include the following 
(preferably the coding sequences as shown below): 

OATP2 (SEQ ID NOS: 1 and 2): 

25 CGGACGCGTG GGCGGACGCG TGGGTCGCCC ACGCGTCCGA CTTGTTGCAG 
TTGCTGTAGG ATTCTAAATC CAGGTGATTG TTTCAAACTG AGCATCAACA 
ACAAAAACAT TTGTATGATA TCTATATTTC AATC ATG GAC CAA AAT CAA 

M D 0 K Q 

30 CAT TTG AAT AAA ACA GCA GAG GCA CAA CCT TCA GAG AAT AAG • 
HLNKTAEAQPSE IJ X 

AAA ACA AGA TAC TGC AAT GGA TTG AAG ATG TTC TTG GCA GCT 
KTRYC NGLKMFLAA 

35 

CTG TCA CTC AGC TTT ATT GCT AAG ACA CTA GGT GCA ATT ATT 
LSLSrIAKTLGAII 

16 
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ATG AAA AGT TCC ATC ATT CAT ATA GAA CGG AGA TTT GAG ATA 317 

M K SSIIKIERRFEI 

TCC TCT TCT CTT GTT GGT TTT ATT GAC GG A AGC TTT GAA ATT 359 

5 " S S S 1 V G r I D G S F . E I 

GGA AAT TTG CTT GTG ATT GTA TTT GTG AGT TAC TTT GGA TCC 4 01 

G N L L V I V F V S Y F G S 

10 AAA CTA CAT AGA CCA AAG TTA ATT GGA ATC GGT TGT TTC ATT 443 
K Li HRPK LIGIGCFI 

ATG GGA ATT GGA GGT GTT TTG ACT GCT TTG CCA CAT TTC TTC 4 85 

MG I GGVL TAL PHF F" 

15 

ATG GGA TAT TAC AGG TAT TCT AAA GAA ACT AAT ATC GAT TCA 527 

M G Y Y ■ R Y S K E T N I'D S 

TCA GAA AAT TCA ACA TCG ACC TTA TCC ACT TGT TTA ATT AAT 569 

.20 SENST.STLS.T CLIN 

CAA ATT TTA TCA CTC AAT AGA GCA TCA CCT GAG ATA GTG GGA 611 
Q I L S LN RAS P E I VG 

25 AAA GGT TGT TTA AAG GAA TCT GGG TCA TAC ATG TGG ATA TAT 653 
K G C L K E S G S Y M W I . Y 

GTG TTC ATG GGT AAT ATG CTT CGT GGA ATA GGG GAG ACT CCC 695 
VFN'GNML. RGZ'GETP 

30 

ATA GTA CCA TTG GGG CTT TCT TAC ATT GAT GAT TTC GCT AAA 737 
I V P- LGLSYID DFAK 

GAA GGA CAT TCT TCT TTG TAT TTA GGT ATA TTG AAT GCA ATA 779 

35 E G H S S LY h G . I L N A I 

GCA ATG ATT GGT CCA ATC ATTGGC TTT ACC CTG GGA TCT CTG 821 
A M I 3? I IGFTLGS L 

40 TTT TCT AAA ATG TAC GTG GAT ATT GGA TAT GTA GAT CTA AGC 863 

F S K M Y V D I G Y V D L S 

ACT ATC AGG ATA ACT CCT ACT GAT TCT CGA TGG GTT GGA GCT 905 
TIRITPTDSRWVGA 

45 

TGG TGG CTT AAT TTC CTT GTG TCT GGA CTA TTC TCC ATT ATT 94 7 

WWLNFLVSGLFSII 

TCT TCC ATA CCA TTC TTT TTC TTG CCC CAA ACT CCA AAT AAA 989 
50 S S I ? F F F L P Q T P N K 

CCA CAA AAA GAA AGA AAA GCT TCA CTG TCT TTG CAT GTG CTG ' 1031 

• - p Q K Z R K A S L S L H V L 

55 GAA ACA AAT GAT GAA AAG GAT CAA ACA GCT AAT TTG ACC AAT . 1073 

E ' T N Z E Y. D Q T A N h T N 
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CAA GGA AAA AAT ATT ACC AAA AAT GTG ACT GGT TTT TTC CAG 
QGKNI7KNVTGFFQ 

5 TCT TTT AAA AGC ATC CTT ACT AAT CCC CTG TAT GTT ATG TTT 
SFKSILTN PLVVM F 

GTG CTT TTG ACG TTG TTA CAA GTA AGC AGC TAT ATT GGT GCT 

VLLTL. LQVSSYIGA 

10 

TTT ACT TAT GTC TTC AAA TAC GTA GAG CAA CAG TAT GGT CAG 

FTYVFKYVEQQYG Q 

CCT TCA TCT AAG GCT AAC ATC TTA TTG GGA GTC ATA ACC ATA 
15 PSSKANI LLGVITI 

CCT ATT TTT GCA AGT GGA ATG TTT TTA GGA GGA TAT ATC ATT 
PI FASGMFLGGY I I 

20 AAA AAA TTC AAA CTG AAC ACC GTT GGA ATT GCC AAA TTC TCA 

KKFKLKTVGIAXFS 

TGT TTT ACT GCT GTG ATG TCA TTG TCC TTT TAC CTA TTA TAT 
C FTAVM SLS FY L LY 

25 

TTT TTC ATA CTC TGT GAA AAC AAA TCA GTT GCC GGA CTA ACC 
FFILCENKSVAGL T 

ATG ACC TAT GAT GGA AAT AAT CCA GTG ACA TCT CAT AGA GAT 
30 MTYDGNN P V T S H R D 

GTA CCA CTT TCT TAT TGC AAC TCA GAC TGC AAT TGT GAT GAA 
V P L S YCNSDCNCDE 

35 AGT CAA TGG GAA CCA GTC TGT GGA AAC AAT GGA ATA ACT TAC 

S Q W E P V C G N N - G I TV 

ATC TCA CCC TGT CTA GCA GGT TGC AAA TCT TCA AGT GGC AAT 
ISPCLAGC KSSSGN 

40 

AAA AAG CCT ATA GTG TTT TAC AAC TGC AGT TGT TTG GAA GTA 

KKP IVFYN C.SCL EV 

ACT GGT CTC CAG AAC AGA .AAT TAC TCA GCC CAT TTG GGT GAA 
45 TGLQNP. NY'SAHLGE 

TGC CCA AGA GAT GAT GCT TGT ACA AGG AAA TTT TAC TTT TTT 
CPRDDACTRKFY F r 

50 GTT GCA ATA CAA GTC TTG AAT TTA TTT TTC TCT GCA CTT GGA 
VAIQVLNLFFSALG 

GGC ACC TCA CAT GTC ATG CTG ATT GTT AAA ATT GTT CAA CCT 
GTSHVML I.V KIVQ? 

55 

GAA TTG AAA TCA CTT GCA CTG GGT TTC CAC TCA ATG GTT ATA 
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1X15 
1157 
1199 
1241 
1283 
1325 
1367 
1409 
1451 
1493 
1535 
1577 
1619 
1661 
1703 
1745 
1787 
1829 
1871 
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, £ L K 5 1. A L- G F • H 3 M V I 

CGA GCA CTA GGA GGA ATT C7A GCT CCA ATA TAT TTT GGG GCT 

R A LG'G I LA ? Z Y FG A 

5 

CTG ATT GAT ACA ACG TGT ATA AAG TGG TCC ACC AAC AAC TGT 
LIDTTCZKHS.TNNC 

GGC ACA CGT GGG TCA TGT AGG ACA TAT AAT TCC ACA TCA TTT 
10 GTRG 5C RTYNSTSF 

TCA AGG GTC TAC TTG GGC TTG TCT TCA ATG TTA AGA GTC TCA 
SRVYLGLSSMLRVS 

15 TCA CTT GTT TTA TAT ATT ATA TTA ATT TAT GCC ATG AAG AAA 
SLVLYIILIYAMKK 

AAA TAT CAA GAG AAA GAT ATC AAT GCA TCA GAA AAT GGA AGT 

KYOEKDIHASENGS 

20 

• gtc atg gat gaa gca aac tta gaa tcc tta aat .aaa aat aaa 
vmdeanlesl'nkn k 

cat ttt gtc cct tct gct ggg gca gat agt gaa aca cat tgt 
25 hfvp5agadsethc 

taa ggggagaaaa aaagccactt ctgcttctgt gtttccaaac agcattgcat 2260 



30 TGATTCAGTA AGATGTTATT TTTG AGG AG T TCCTGGTCCT TTCACTAAGA , 2 310 

ATTTCCACAT CTTTTATGGT GGAAGTATAA AT AAG CCT AT GAACTTATAA 2360 

TAAAACAAAC TGTAGGTAGA AAAAATG AG A GTACTCATTG TTACATTATA 2410 

G CTACAT ATT TGTGGTTAAG GTTAGACTAT ATG ATC CAT A CAAATTAAAG 24 60 

TGAGAGACAT GGTTACTGTG TAAT AAAAG A AAAAATACTT GTTCAGGTAA 2510 

35 TTCTAATTCT TAATAAAACA AATGAGTATC ATACAGGTAG AGGTTAAAAA 2560 

GGAGGAGCTA GATTCATATC CTAAGTAAAG AGAAATGCCT AG TGTCTATT 2 610 

TTATTAAACA AACAAACACA GAGTTTGAAC TATAATACTA AG G C CTG AAG 2 660 

TCTAGCTTGG ATATATGCTA CAATAATATC TGTTACTCAC ATAAAATTAT 2710 

ATATTTCACA GACTTTATCA ATGTATAATT AACAATTATC TTG TTT AAG T 2760 

40 AAATTTAGAA TACATTTAAG TATTGTGGAA GAAATAAAGA CATTCCAATA 2 810 

TTTGCAAAAA AAAAAAAAAA 2830 
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1913 
1955 
1997 
2039 
2081. 
2123 
2165 
2207 



OATP-RP2 (SEQ ID NOS:3 and 4): 

45 CCCGGGTCGA CCCACGCGTC CGGGATAAAG TACTCCCAGG AAGGCTTTGA 50 
GCCTTGGCAG AAGAGGCTGG GATTGAAGCT TCAGGGAGAG CCAGAGGTGA 100 
GGCTGGAGTG GG AG ATC ACC TGAGGCAGGG CCAGCGGGTG AGGTACCCCA 150 
GGTACCAGAC AAGGAAACCA AAGCCACA ATG GGC ACA GAA AAC ACA CCT 199 

MGTENT P 

50 

GGA GGC AAA GCC AGC CCA GAC CCT CAG GAC GTG CGG CCA AGT 241 

GGKA5 P*DPQD7RPS 

GTG TTC CAT AAC ATC AAG CTG' TTC GTT CTG TGC CAC AGC CTG 2 83 
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CTG CAG CTG GCG CAG CTC ATG ATC TCC GGC TAC CTA AAG AGC 
LQLAQLMXSGYLKS 



325 



TCC ATC TCC ACA GTG GAG AAG CGC TTC GGC CTC TCC AGC CAG 
SI STVEKRFGLSSQ 



367 



!0 



ACG TCG GGG CTG CTG GCC TCC TTC AAC GAG GTG GGG AAC ACA 
TSGL LASFNEVGWT 



409 



GCC TTG ATT GTG TTT GTG AGC TAT TTT GGC AGC CGG GTG CAC 

A L I V F V S Y F G S R V H 



451 



15 CGA CCC CGA ATG ATT GGC TAT GGG GCT ATC CTT GTG GCC CTG 

RP RMIGYGAILVAL 



493 



20 



GCG GGC CTG CTC ATG ACT CTC CCG CAC TTC ATC TCG GAG CCA 535 
AGLLMTLPHFISEP 

TAC CGC TAC GAC AAC ACC AGC CCT GAG GAT ATG CCA CAG GAC 577 

Y R Y D M T S ? E D M P Q D 



25 



TTC AAG GCT TCC CTG TGC CTG CCC ACA ACC TCG GCC CCA GCC 
FKASLCL PTTSAPA 



619 



TCG GCC CCC TCC AAT GGC AAC TGC TCA AGC TAC ACA GAA ACC 
SAPSNGNCSSYTET 



661 



30 CAG CAT CTG AGT GTG GTG GGG ATC ATG TTC GTG GCA CAG ACC 
QHLSVVGIMFVAQT 



35 



CTG CTG GGC GTG GGC GGG GTG CCC ATT CAG CCC TTT GGC ATC 
LLGVGGVPIQPFGI 

TCC TAC ATC GTT GAC TTT GCC CAC AAC AGT AAC TCG CCC CTC 

SYIVDFAHNSNSPL 



703 



745 



787 



40 



TAC CTC GGG ATC CTG TTT GCA GTG ACC ATG ATG GGG CCA GGC 

YLG I LFAV'TMMGPG 



829 



CTG GCC TTT GGG CTG GGC AGC CTC ATG CTG CGC CTT TAT GTG 
LAFGLGSLMLRLYV 



871 



45 GAC ATT AAC CAG ATG CCA GAA GGT GGT ATC AGC CTG ACC ATA 
DINQMPEGGISLT I 



913 



50 



AAG GAC CCC CGA TGG GTG GGT GCC TGG TGG CTG. GGT TTC CTC 955 

K D P r. W V G A W W L G ? L 

ATC GCT GCC GGT GCA GTG GCC CTG GCT GCC ATC CCC TAC TTC 997 

I A A G A V A L A A I P Y F 



55 



TTC TTC CCC AAG GAA ATG CCC AAG GAA AAA CGT GAG CTT CAG 

F T P X E M ? ' K E K R E L Q 



1039 
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TTT CGG CGA AAG GTC TTA GCA GTC ACA GAC TCA CCT GCC AGG 1081 

F R RKVLAV'TDS PAR 

AAG GGC AAG GAC TCT CCC TCT AAG CAG AGC CCT GGG GAG TCC 1123 
5 KGKDSPSK QSP GZS 

ACG AAG AAG CAG GAT GGC CTA GTC CAG ATT GCA CCA AAC CTG 1165 

T K K Q D G L -V 0 I A P U L 

10 ACT GTG ATC CAG TTC ATT AAA GTC TTC CCC AGG GTG CTG CTG 1207 
TVIQFI KVFPRVLL 



15 



30 



45 



CAG ACC CTA CGC CAC CCC ATC TTC CTG CTG GTG GTC CTG TCC 1249 

QTLREPI.FLL V V L S . 

CAG GTA TGC TTG TCA TCC ATG GCT GCG GGC ATG GCC ACC TTC 1291' 

Q V C L S S M A A G M A.T F 



CTG CCC AAG TTC CTG GAG CGC CAG TTT TCC ATC ACA GCC TCC 1333 
20 LPKFLERQFSITAS 

TAC GCC AAC CTG CTC ATC GGC TGC CTC TCC TTC CCT TCG GTC 1375 
Y ANLLIGCLSFPSV 

25 ATC GTG GGC ATC GTG GTG GGT GGC GTC CTG GTC AAG CGG CTC . 1417 
I V G I V V G G V L V K R L 

CAC CTG GGC CCT GTG GGA TGC GGT GCC CTT TGC CTG CTG GGG 14S9 
HLGPVGCGAL CLLG 



ATG CTG CTG TGC CTC TTC TTC AGC CTG CCG CTC TTC TTT ATC 1501 

M L L C L. F F S . L P L F F I 



GGC TGC TCC AGC CAC CAG ATT GCG GGC ATC ACA CAC CAG ACC 1543 
35 GCSSHQIAGITHQT 

AGT GCC CAC CCT GGG CTG GAG CTG TCT CCA AGC TGC ATG GAG 1585 
SAHPG LE LSPSCM E 

40 GCC TGC TCC TGC CCA TTG GAC GGC TTT AAC CCT GTC TGC GAC 1627 
A CS CPLDGFNPVCD 

CCC AGC ACT CGT GTG GAA TAC ATC ACA CCC TGC CAC GCA GGC 1669 
PSTRVEY I TPCHAG 



TGC TCA AGC TGG GTG GTC CAG GAT GCT CTG GAC AAC AGC CAG 1711 
CSSW VVQDALDNSQ 



GTT TTC TAC ACC AAC TGC AGC TGC GTG GTG GAG GGC AAC CCC 1753 
50 VFYTNCSCVVEGNP 

GTG CTG GCA GGA TCC TGC GAC TCA ACG TGC AGC CAT CTG GTG 1795 

. V L AGS CDS T. CSHLV 

55 GTG CCC TTC CTG CTC CTG GTC AGC CTG GGC TCG GCC CTG GCC 1837 

V P F ^ L L V S h . G S A L t A 
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TGT CTC ACC CAC ACA CCC TCC TTC ATG CTC ATC CTA AGA GGA 1879 
CLTH7PSFMLI LRG 

GTG AAG AAA GAA GAC AAG ACT TTG GCT GTG GGC ATC CAG TTC 1921 

V K K E D K T L A V G I 0 F 

ATG TTC CTG AGG ATT TTG GCC TGG ATG CCC AGC CCC GTG ATC 1963 

MFLR'XLAWMPSPVX 

CAC GGC AGC GCC ATC GAC ACC ACC TGT GTG CAC TGG GCC CTG 2005 
HGSAIDTTCVHWAL 



AGC TGT GGG CGT CGA GCT GTC TGT CGC TAC TAC AAT AAT GAC 204 7 
15 SCGRRAV CRYY.NND 

CTG CTC CGA AAC CGG TTC ATC GGC CTC CAG TTC TTC TTC AAA 2 089 
L L R N R F I G L Q F F F K 

20 ACA GGT TCT GTG .ATC TGC TTC GCC TTA GTT TTG GCT GTC CTG 2131 
TGSV I CFALVLAVL 

AGG CAG CAG GAC AAA GAG GCA AGG ACC AAA GAG AGC AGA TCC 2173 
R Q Q D K E . A R T K E S R S 



25 



AGC CCT GCC GTA GAG CAG CAA TTG CTA GTG TCG GGG CCA GGG 2215 
S PAVE QQLLVSGP G 



AAG AAG CCA GAG GAT TCC CGA GTG TGA GCTGTCTTGG GGCCCCACCT 2 262 
30 KKPEDSRV* 

GGCCAAGAGT AG CAG C C ACA G CAG TAC CTC CTCTGAGTCC TTTGCCCAAG 2 312 

ATTGGGTGTC AAGAGCCCTG TGTTCCATTC TGGCTCCTCC ACTAAATTGC 2362 

TGTGTGACTT CAGGCAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2412 

35 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2442 



OATP-RP3 (SEQ ID NOS:5 and 6): 

CC CACGCGTCCG 12 

40 GCGAGGAGCT GTGCCTTCCA CCTCTCCAGC CCCGGCAGGA CGG GGG CGG C 62 
CGCCGCGAAC CCGGGGCGGG GACAGCACGC AGCCTCGAGG CGCGCACCCC 112 
CGCCCGGCAG CGGCCCCGAC ACCCGGGGCG AGCGGGAAAG CGGCAGCGGC 162 
GGCGGCGGCG GCGGCGGCGG GGGAAGG ATG CAG GGG AAG AAG CCG GGC 210 

M Q. G K K P G 

45 

GGT TCG TCG GGC GGC GGC CGG AGC GGC GAG CTG CAG GGG GAC 252 
GSSGGGR.5GELQGD 



GAG GCG CAG AGG AAC AAG AAA AAG AAA AAG AAG GTG TCC TGC 2 94 
50 EAQRNKKKKK KVSC 

TTT TCC AAC ATC AAG ATC TTC CTG GTG TCC GAG TGC GCC CTG 336 
FSNI KI FLVSECAL 
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ATG CTG GCG CAG GGC ACG GTG GGC GCC TAC CTG GTG AGC GTC 
MLAQGTVGA V L V 5 V 



378 



CTG ACC ACC CTG GAG CGT AGG TTC AAC CTG CAG AGC GCT GAC 

L T T L Z . ?. = F N " L 0 S A D 



420 



GTG GGT GTG ATC GCT AGC AGC TTC GAG ATC GGG AAC CTG GCG 4 62 



V 



I 



N 



L A 



10 CTC ATC CTC TTC GTG AGC. TAC TTC GGG GCA CGC GGG CAC CGG 
L I L F V S V F G A R G H R 



504 



15 



CCG CGC CTG ATC GGC TGC GGC GGC ATC GTC ATG GCG CTG GGC 54 6 

PR L IGCG GIVMALG 

GCG CTG CTG TCG GCG CTG CCC GAG TTC CTG ACC CAC CAG TAC 588 

A L L S "A L P E F L T H O Y 



20 



AAG TAC GAG GCG GGC GAG ATC CGC TGG GGC GCC GAG GGC CGC 
KYEA GEIRW G A EGR 



630 



GAC GTC TGC GCA GCC AAC GGC TCG GGC GGC GAC GAG GGG CCC 
D VCAAW GSGGDEGP 



672 



25 GAC CCC GAC CTC • ATC TGC CGC AAC CCG ACG GCT ACC AAC ATG 
D PDLICRN RTATN M 



714 



30 



atg tac ttg ctg ctc att ggg gcc cag gtg ctc ctg ggc atc 756 
mylllig aqv llgi. 

ggt gct acc cct gtg cag ccc ctg ggc gtc tcc tac atc gac 798 

gatpvqpl'gv.syxd 



35 



GAC CAC GTG CGG AGG AAG GAC TCC TCG CTC TAT ATA GGA ATC 

D H V R ?. K D S S L Y I G I 



840 



CTG TTC ACG ATG CTG GTA TTT" GGA CCA GCC TGC GGG TTT ATC 
L F T M L V r G ? A C G ? I 



862 



40 CTG GGC TCT TTC TGT ACC AAA ATC TAC GTG GAT GCG GTC TTC 

LGSFCTKI.YVDAVF 



924 



45 



ATT GAC ACA ACT AAC CTG GAC ATC ACT CCG GAC GAC CCC CGC 966 

IDT'SNLDITPDDPR. 

TGG ATC GGA GCC TGG TGG GGT GGC TTT CTG CTC TGC GGT GCC 1008 
W I G A W W G G F L L C G A 



50 



TTA CTC TTC TTC TCT TCC CTC TTG ATG TTT GGG TTT CCA CAG 
L L F F 5 S L L M F G F P Q 



1050 



TCC CTG CCC CCG CAC TCA GAC CCC GCC ATG GAA AGC GAG CAG 
SLPPHS DPAMES EQ 



1092 



55 GCC .ATG CTC TCC GAA AGA GAA TAC GAG AGA CCC AAG CCC AGC- 
A M L S S ?. E Y E R • P K P S 
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AAC GGG GTC CTG AGG CAC CCC CTG GAG CCA GAC AGC AGT GCC 
NGVLP. KPLEPD55A 

5 TCC TGT TTC CAG CAG CTG AGA GTG ATC CCG AAG GTC ACC AAG 

SC FQQLRVI PKVTK 

CAC CTG CTC TCA AAC CCT GTG TTC ACC TGC ATC ATC CTG GCC 

H L L S M PVFTCIILA 

10 

GCC TGC ATG GAG ATT GCA GTG GTG GCT GGC TTC GCT GCC TTT 

ACMZI AVVAGFAAF 

TTG GGG AAG TAC CTG GAG CAG CAG TTT AAC CTC ACC ACC TCT. 
15 LGKYLEQQFNLTTS 

TCT GCC AAC CAG CTG CTT GGG ATG ACT GCG ATC CCG TGT GCT 
SANQ LLGMTAIPCA 

20 TGT CTG GGT ATC TTC CTG GGA GGT CTT TTG GTG AAG AAG CTC 
CLGIFLGGLLVKKL 

AGC CTG TCT GCC CTG GGG GCC ATT CGG ATG GCC ATG CTC GTC 
S L SALG 'A I RMAMLV 

25 

AAC CTG GTG TCC ACT GCT TGC TAC GTC TCC TTC CTC TTC CTG 
N LVSTACYVSFLFL 

GGC TGC GAC ACT GGC CCT GTG GCT GGG GTT ACT GTT CCC TAT 
30 GCDTGPV AGVTVPY 

GGA AAC AGC ACA GCA CCT GGC TCA GCC CTG GAC CCC TAC TCG 
GNS TAPGSALDPYS 

35 CCC TGC AAT AAT AAC TGT GAA TGC CAA ACC GAT TCC TTC ACT 

P C N ?I N C E C Q T • D S F T 

CCA GTG TGT GGG GCA GAT GGC ATC ACC TAC CTG TCT GCC TGC 
PVCGADG I TYLSAC 

40 

TTT GCT GGC TGC AAC AGC ACG AAT CTC ACG GGC TGT GCG TGC 
FAGCNSTNLTGCAC 

CTC ACC ACC GTC CCT GCT GAG AAC GCA ACC GTG GTT CCT GGA 
45 LTTVPAENATVVPG 

AAA TGC CCC AGT CCT GGG TGC CAA GAG GCC TTC CTC ACT TTC 

KCPSPGC QEAFLTF 

50 CTC TGT GTG ATG TGT ATC TGC AGC CTG ATC GGT GCC ATG GCA 
LCVMC ICSLIGAM 'A 

CAG ACA CCC TCA GTC ATC ATC CTC ATC AGG ACA GTC AGC CCT 
QTPSVI-ILIRTVSP 

55 

GAA CTC AAG TCT TAC GCT TTG GGA GTT CTT TTT CTC CTC CTT 



PCT/USOO/13939 

1176 
1218 
1260 
1302 
1344 
1386 
1428 
1470 
1512 
1554 
1596 
1638 
1680 
1722 
1764 
1806 
1848 
1890 
1932 
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E 



G 



V 



L 



r 



L 



CGT TTG TTG GGC TTC ATC CCT CCA CCC CTC ATC TTC GGG GCT 1974 
RL LG rl P?P- LIFGA 

5 • 

GGC ATC GAC TCC ACC TGC CTG TTC TGG AGC ACG TTC TGT GGG • 2 016 
GID STCLFWSTFCG 

GAG CAA GGC GCC TGC GTC CTC TAC GAC AAT GTG GTC TAC CGA 2 OS 8 
10 E Q GAC V L Y D N V V Y R 

TAC CTG TAT GTC AGC ATC GCC ATC GCG CTC AAA TCC TTC GCC 2100 
YLYVS I AIALKS FA 

15 TTC ATC CTG TAC ACC ACC ACG TGG CAG TGC CTG AGG AAA AAC 2142 
FILYTTTWQC LRKN. 

TAT AAA CGC TAC ATC AAA AAC CAC GAG GGC GGG CTG AGC ACC 2184 

YKRY I.KNHEGGLST 

20 

AGT GAG TTC TTT GCC TCT ACT CTG ACC CTA GAC AAC CTG GGG 2226 
S E F FAS TLTLDN LG 

AGG GAC CCT GTG CCC GCA AAC CAG ACA CAT AGG ACA AAG TTT 2268 
25 RDPV PANQ THRTKF- 

ATC TAT AAC CTG GAA GAC CAT GAG TGG TGT GAA AAC ATG GAG 2310 
IY NLEDHEWCENME 

30 TCC GTT TTA TAG TGACTAAAGG AGGGCTGAAC TCTGTATTAG TAATCCAAGG 2362 
S V L 

GTCATTTTTT TCTTAAAAAA AGAAAAAAAG GTTCCAAAAA AAACCAAAAC 2412 

TCAGTACACA CACACAGGCA CAG ATG C ACA CACACGCAGA CAGACACACC 24 62 . 

35 GACTTTGTCC TTTTTCTCAG CATCAGAGCC AGACAGGATT CAGAATAAGG 2512 

AGAGAATGAC ATCGTGCGGC AGGGTCCTGG AGGCCACTCG CGCGGCTGGG 2562 

CCACAGAGTC TACTTTGAAG GCACCTCATG GTTTTCAGGA TGCTGACAGC 2612 

TGCAAGCAAC AGGCACTGCC AAATTCAGGG AACAGTGGTG GCCAGCTTGG 2662 

AGGATGGACA TTTCTGGATA CACATACACA TACAAAACAG AAAACATTTT 2712 

40 TTAAAAGAAG TTTCCTAAAA TAAAAAAAAT AAAAAAAAAA AAAAA 2757 

OATP-RP4 (SEQ ID NOS:7 and 8) (Nucleotide 713, designated Y, can be either a C 
(in which case the encoded amino acid X is Leu) or a T (in which case the encoded 
amino acid X is Phe); Nucleotide 2397, designated K, can be either a G (in which case 
45 the encoded amino acid X is Gly) or a T (in which case the encoded amino acid X is 



Val)): 



50 



CTGATTTCTC TTCGGCTGGA CGGAGGCTGC CTCCTCACGC GGCTCCCAAC 

TATTCCCGTA GCTCAGTGCC CCCCTCCCGC CGCTCTACTC AG CC AGG CAG 

ACAGACTGAC AGACTCGCTA GTCGGCAGCT TCACTCCCGA GGGTGCCGCG 

AGCCCAGGCG GCGAACACCC GGTACCCCTG GCGCAGCGAG GTGGGATGCT 



50 
100 
150 
200 
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10 



GTACGGACAG CAGCGCTAAG TCCCCCCCCA CCCCCGGCGC AGGGTGCACT 250 

CGCTCCTGGC CGCGGGCCCA GCGGCGGCGG CGGCGGCGGC GGCGGAGGGG 3 00 

ATGAGCCCGG GACGCGCGAG GCGCCTGCCT CAAGCTACCG CCCGGAGAGG 3 50 

GACGCCGAGT AGGGCTCATC GCAGTACCGC GCGGACCCCT GCCCCCTGTG 400 

GCACGCGGCT GCGGAGCCTT GAAGCCGTGT CTGTGATCAG GATGCACTGG 4 50 

GCGCCTCGCA GCTGGTGAGG ATGCCCTGCT GCGCGGCCCT GCGCCCCCAG 500 

CCCCAGTCCC AGGTGGGCAA GACTGACTGG GCCCGGCTTC GGCCCCTCGT 550 

GCCGGTGGAT GAAACGTGCC GGAGTGCTTG GGTGCCATCA GCTATCAAAT 600 

CTGAATTCTA AGCGCC ATG GAC GAA GGC ACT GGA CTG CAG CCC GGG 64 6 
MDEGTGLQPG 



GCG GGA GAG CAG CTG GAG GCG CCG GCC ACT GCA GAA GCT GTC 
AG E-QLEAPATAEAV 



68B 



15 CAA GAG AGG TGC GAG CCG GAG ACC YTC AGG TCT AAG AGT TTA 
QE RCEPETXRS KSL 



730 



20 



CCG GTC CTC AGC AGC GCC TCC TGC CGG CCA AGC CTC AGT CCC 772 
PVLSSASCRPSLSP 

ACT AGT GGA GAC GCC AAC CCG GCC TTT GGC TGT GTG GAT TCT B14 

TSGDANPAFG-CVDS 



25 



TCG GGC CAC CAG GAG TTG AAG CAA GGC CCG AAC CCG TTG GCC 
SGHQELK QGP NPLA 



856 



CCC AGT CCC TCT GCC CCG TCC ACT TCG GCG GGG CTC GGG GAC 
PS PSAPSTSAGLGD 



898 



30 TGT AAC CAC AGG GTG GAC CTC AGC AAA ACC TTC TCG GTG TCC 
CNHRVDLSKTFSVS 



940 



35 



TCC GCC TTG GCC ATG CTC CAG GAG AGA AGG TGC CTC TAC GTG 982 
SAL AMLQERRCLYV 

GTC CTC ACG GAT TCC CGT TGC TTC CTG GTG TGC ATG TGC TTT 1024 
VLTDSRCFLVCMC ? 



40 



CTG ACC TTC ATC CAG GCG TTA ATG GTC TCT GGG TAC CTG AGC 
LTFIOALM VSGYLS 



1066 



AGC 

S 



GTA ATT ACC ACC ATT GAA AGG CGC TAC AGT 
VITTIERRYS 



CTG AAG 

h K 



AGT 

S 



1108 



45 TCC GAG TCG GGG CTG CTG GTC AGC TGC TTT GAC ATC GGG AAC. 
SESGLLVSCFDIGN 



1150 



50 



CTG GTG GTG GTG GTG TTC GTC AGC TAC TTC GGC GGC CGG GGT 1192 

LVVVVFVSYFGGRG 

CGG CGG CCC CTG TGG CTG GCC GTG GGT GGA CTC CTC ATC GCC 1234 
RRPLWLAVGGLLI A 



55 



TTC GGG GCA GCC CTC TTC GCC TTA CCT CAC TTC ATC TCG CCC 

FGAALFALPHF IS P 



1276 
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CCC TAC CAG ATC CAA GAG TTG AAC GCC TCG GCC CCC AAC GAC 1318 
p y Q I 0 Z L K A S . A ? II D 

GGC CTG TGT CAG GG7 GGC AAC TCC ACC GCC ACT TTG GAG CCT 1360 

5 GLCQGGNS TAT L EP 

CCG GCC TGT CCG AAG GAC TCG GGA GGA AAT AAT CAC TGG GTC 14 02 . 

PA C ? K 0 5 G G M H H W 

10 TAC CTG GCT TTA'TTC ATT TGC GCG CAG ATT CTC ATT GGA ATG 1444 
y L A L F I C A Q I L I G M 

GGC TCC ACA CCT ATT TAT ACC CTG GGA CCA ACC TAC TTA GAT 14 86 

GSTPIYT-LGPTYLD 

15 

GAC AAT GTC AAG AAA GAA AAC TCC TCC TTG TAC CTA GCC ATC - 1528 
DNVK KENSSLYL AI 

ATG TAT GTC ATG GGA GCA CTT GGC CCT GCA GTG GGA TAT TTA 1570 
20 M Y V M G A L G F A V G V L 

TTA GGT GGA CTT CTT ATT GGT TTT TAT GTT GAT CCC AG A AAT 1612 

L G G h L I . G F Y V D P R N 

25 CCT GTT CAC CTT GAC CAG AAT GAC CCT CGT TTC ATT GGA AAC i654 
p V H L D Q N D P R F I G. N 

TGG TGG AGT GGA TTC CTC CTT TGT GCC ATT GCA ATG TTT CTT 1696 
WWSGFLLCAI AMFL 

30 

GTG ATA TTC CCA ATG TTT ACT TTC CCA AAA AAG CTT CCA CCT 173 8 

V.I F P M FT F P K KL F P 

CGA CAC AAG AAA AAG AAA AAG AAA AAA TTT TCT GTT GAT GCT 1780 . 

35 ' R H K K K K K K K F S V D A 

GTT AGT GAT GAC GAT GTT CTG AAG GAG AAA TCA AAC AAC AGT 1822 
V S D D D V L K E K S N N S 

40 GAA CAA GCG GAC AAA AAA GTT TCT TCG ATG GGA TTT GGA AAG 1864 
EQADK KVSSM G FGK' 

GAT GTC AGA GAC CTA CCA AGA GCA GCT GTC AGG ATC TTA AGC 1906 
DVRDLPRAAVRIL'S 

45 

AAC ATG ACA TTC CTT TTT GTG AGT TTG TCA TAC ACA GCT GAG 194 8 
NMTFLFVSLSYT AE 

AGT GCC ATT GTA ACT GCT TTC ATT ACC TTC ATT CCC AAG TTC 1990 
50 SAIVTAFITFIPK .F 

• ATC GAG TCA CAG TTT GGT ATC CCA GCC TCC AAT GCC AGC ATC 2032 
I E S 0 F G I PAS N A E I 

55 TAC ACT GGG GTT ATT ATC GTC CCC AGT GCT GGT GTT GGT ATT 2074 

y 7 . G V I I V • P S ■ A G 7 G I 

27 ' 
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40 
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GTC CTC GGA GGC TAC ATT ATA AAA AAA TTG AAA CTT GGT GCC 2116 
V L G G Y I IK KL KLGA 

AGA GAA TCT GCA AAA CTA GCA ATG ATC TGC AGT GGT GTG TCT 2158 

HE SAKIiAM I CSGVS- 

TTA CTA TGT TTT TCA ACC CTA TTT ATT GTT GGA TGT GAA AGC 2200 

LLCFSTLFIVGCES 

ATT AAT CTA GGG GGC ATA AAC ATC CCT TAT ACA ACA GGA CCT 2242 
INLGGINIPYTTG P 



TCT CTC ACC ATG CCC CAT AGG AAT CTG ACA GGA AGC TGC AAC 2264 
15 S L T M P H R N L T G S C N 

GTT AAT TGT GGT TGT AAA ATA CAC GAG TAT GAG CCA GTC TGT 2326 

VN.CGCKIHEYEPVC 

20 GGA TCA GAT GGA ATT ACA TAC TTT AAC CCT TGT CTG GCT GGC 23 68 
GSD GITYFNPCLA G 

TGT GTT AAT AGT GGT AAT CTT AGC ACT GKG ATA CGG AAT TAT 2410 
CVNSGNLSTXIRNY 



ACA GAA TGC ACC TGT GTC CAA AGT CGC CAA GTG ATC ACT CCA 2452 
TECTCVQSRQVITP 



CCC ACC GTG GGA CAG CGA AGT CAG CTC CGT GTG GTT ATT GTC 24 94 
30 PTVGQRSQLRV-VIV 

AAG ACT TAT CTC AAT GAG AAC GGC TAT GCT GTG TCT GGG AAA 2536 
KTYLNENGYAVSGK 

I 

35 TGT AAA CGG ACC TGC AAT ACT CTT ATC CCA TTC TTA GTT TTT 257 8 
CKRTCNT LI PFLVF 

CTT TTC ATA GTC ACC TTC ATC ACA GCA TGT GCC CAA CCA TCA 2620 
LFIVTFITACAQPS 



GCT ATC ATA GTA ACA CTC AGG TCC GTA GAA GAT GAG GAG AGA 2662 
AI IVTLRSVEDEER 



cct ttt gca ctg gga atg cag ttt gtt ttg ttg cga aca ctt 2704 
45 pfa lgmqfvllrtl 

gca tac att cct act cca atc tac ttt gga gca gtc att gac 274 6 

a yiptpiyfgavid 

50 acc acc tgc atg ctc tgg caa cag gaa tgt ggt gtg cag ggt 2788 

ttcmlwqqec'gvqg 

tct tgc tgg gag tac aac gtg acg tcg ttt cgt ttt gtg tat 2 83 0 
scweynvts frfvy 

55 

TTT GGT TTG GCT GCC GGC CTC AAA TTC GTT GGG TTT ATT TTT 2 872 
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50 



F G 




A 


A 


G 














T 
A 


F 




ATT TTT 


CTG GCC 


TGG 


TAC 


TCC 


ATA , 


AAA 


TAC 


AAG 


GAG 


GAT 


GGA 


2914 


I F 




A 


W 


V 


s 


I 


K 


V 


K 


£ 


D 


G 




CTG CAG 


AGG CGG 


AGG 


CAG 


AGA 


GAA ' 


TTT 


CCC 


CTG 


AGC 


ACC 


GTG 


2956 


L. O 
u v 


F. 


R 


R 


o 


p. 




c 


? 




s 


T 


V 




AGT GAG 


AGA GTG 


GGA 


CAC 


CCC 


GAC . 


AAT 


GCC 


CGG 


ACT 


AGA 


TCT 


299B 


O fit 


R 


V 






p 




U 




R 


T 


R 


s 




TGC CCA 


GCT TTC 


AGC 


ACC 


CAG 


GGA i 


GAA 


TTC 


CAC 


GAA 


GAG 


ACT 


3040 


C P 


A 


F 


S 


T 


Q 


G 


E 


F 


H 


£ 


E 


T 




GGC CTG 


CAA AAA 


GGG 


ATC 


CAG 


TGC 1 


GCA 


GCA 


CAG 


ACC 


TAC 


CCG 


3082 


G L 


0 


K 


G 


T 


0 


C 


A 


• A 


Q 


T 




P 




GGG CCC 


TTC CCA 


GAA 


GCA 


ATA 


AGT ' 


rcc 


TCT 


GCG 


GAC 


CCG 


GGG 


3124 


G P 


F 


P 


E 


A 


T 


S 


s 


S 




D 


P 


G 




CTG GAA 


GAG AGC 


CCC 


GCT 


GCC 


TTG 1 


GAG 


CCG 


CCC 


TCC 


TGA 




3163 


L E 


E 


S 


? 


A 


A 


L 




P 


? 


S 








AGCTTGAAAA 


TGGAAGAATT TAGTTTTGTT 


GGTTGAATTG 


AAAATGGCGA 


3213 


CTTGAGAAAC 


AACTGTGCCT TCTTTTCTTT 


CTTTCTTTTT 


TTTAACCTCT 


3263 


ACAGACACAA 


TCCTCAAACC AACAAAACTC 


AGTATACACA 


GCCGCTATTC 


3313 


ATTGAGGGCT 


GGATAC CTCA ACAAGACTGA 


GAGCCTTTCC 


CCGCTTCTCT 


3363 


CCAAGAAGGA 


GACGTTCAGC TAGATTTGTT 


CCCATTTCCG 


TTGTGTTAAT 


3413 


TCAAAGCTCA 


TGCTCCCCTA CGGTACAGGC 


TGAGGTACAC 


GGTTAGCAAA 


3463 


ACCATGGGAA 


GGGGAATGGC GGTGCATATC 


ATTAACTAAC 


ACTCCAAACA 


3513 


AAGGTGAGCT 


TGCCCAGGAC TTGGCATTTC 


CAAATCAAAG 


TTTTTAGATA 


3563 


TGAACACCTA 


CTGTGAGTTC TGCTACAAAG 


CACAAATGAA 


TTTGTCTCAA 


3613 


CTATGCAATT 


TGATTGGAAA AATGTATGTG 


CAGCATGTTA 


CATTTACTTT 


3663 


CACGGAATAA 


AGCAGATATG TTTCTGAAA 














3692 



10 



15 



20 



25 



30 



35 



OATP-RP5 (SEQ ID NOS:9 and 10): 

CGCAAAGAAA TGG CTCAAAA GCTTCAGCTC TTT CTG TGC C CTGGGAGCTG 50 

AGATGCACGT CAGTGGCCTT GCCAGCGTGG CCAATTCTCT GCTGACTGCC 100 

40 AGAAAAAAGA G GCC AGG AAG AAAGAGGAAA GAGAAGAGAT CGCTCAGGGG ISO 

TGAGACCATG CCCTTCATCT TTTCTTTTCC CTAATCTCCT CTGCTTGTGT 200 

CCACCCACAC TCTCCCCACC TGG C AAAATT GTTCAAAATT GCTGTGGAGT 250 

TTACCTCAGT TTCCTCTTTC AGTCTGTGGT GTGTGGTCCA TCCTCTTGCT 3 00 

GAGCACATTG AAAGGAACTG GCTATCTTTG ATCTCTTCCT CCAGATCAGA 350 

45 GTCAAGGAAT GTGTTTATA ATG GAC ACT TCA TCC AAA GAA AAT ATC 3 96 

MDTSSKEN I 

CAG TTG TTC TGC AAA ACT TCA GTG CAA CCT GTT GGA AGG CCT 4 38 
QLFCKTSVQ'PVGR? 



TCT TTT AAA ACA GAA TAT CCC TCC TCA GAA GAA AAG CAA CCA 4 80 

SFKTSY'P'SSEEKQ? 

TGC TGT GGT GAA CTA AAG GTG TTC TTG TGT GCC TTG TCT TTT 522 
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c c g e i :•: v f 

GTT TAC TTT GCC AAA GCA TTG GCA 

v y ? a :-; ALA 

5 

ACC ATC ACT CAG ATA GAG AGA AGG 
TITQIERR 

CTG GTG GGA GTT ATT GAT GGT AGT 
10 LVG VIDGS 

TTA GTT ATA ACA TTT GTT AGC TAC 
LVITFVSY 

15 AGG CCA AAA ATA ATT GGA GCA GGG 
RPKIIGAG 

GGA ACA CTG CTC ATT GCA ATG CCT 
GTLLIAMP 

20 

TAC AAA TAT GAG AGA TAT TCT CCT 

Y K Y Z R Y S P 

AGC ATC TCT CCG TGT CTC CTA GAG 
25 SISPCLLE 

GTT TCA GTT ATG GAA AAA TCA AAA 
VSVMEKS K 

30 TGT GAA GTG GAC ACT AGC TCT TCC 
CEVDTSSS 

CTG GGC AAT CTT CTT CGT GGA ATA 
LG NLLRGI 

35 

CCT TTG GGC ATT GCC TAC CTG GAT 
PLC IAYLD 

AAT GCA GCT TTC TAT ATT GGG TGT 
40 N A A F Y I G C 

ATA GGA CCA ATC TTT GGT TTC CTG 
IGPIFGFL 

45 AAA CTA TAT GTT GAC ATT GGC TTT 

K L Y V D I G F 

ACC ATT ACC CCA AAA GAT CCC CAG 
TITPKDPQ 

50 

CTT GGC TAT CTA ATA GCA GGA ATC 
LGYLIAGI 

GTG CCT TTC TGG TAT TTA CCA AAG 

55 V ? F. W Y L P K 
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L C A 1 S ? 

GAA GGC TAT CTG AAG AGC '564 

E G Y L K S 

TTT GAT ATC CCT TCT TCA 606 

F D I ? S S 

TTT GAA -ATT GGG AAT CTC 64 8 

F E I G W . L 

TTT GGA GCC AAA CTT CAC 690 

F G A K L H 

TGT GTA ATC ATG GGA GTT 732 

C V I M G V 

CAG TTC TTC ATG GAG CAG 774 

Q F F M E Q 

TCC TCC AAT TCC ACT CTC 816 

S S N S T L 

TCA AGC AGT CAA TTA CCA 858 
S S S 0 L P 

TCC AAA ATA AGT AAC GAA 900 

S K I S N E 

ATG TGG ATT TAT GTT TTC 942 

M W I Y V F 

GGA GAA ACT CCC ATT CAG 984 

G E T i ? I Q 

GAT TTT GCC AGT GAA GAC 1026 . 
D F A S E D 

GTG CAG ACG GTT GCA ATT 1068 

V Q T V A I 

TTA GGC TCA TTA TGT GCC 1110 
L G S L C A 

GTA AAC CTA GAT CAC ATA 1152 

V N L " D H I 

TGG GTA GGA GCC TGG TGG 1194 

W VGA W W 

ATA AGT CTT CTT GCA GCT 1236 

I S L • L A A 

AGT TTA CCA AGA TCC CAA 1278 

S L ? R S Q 
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AGT AG A GAG GAT TCT AAT TCT 
S R ZDS 11 S 

ATT ATA GAT GAT CAC ACA GAC 

5 I I D D H T D 

AAT GCA AAA ATA ATG GAA ATG 

N A K I l'i Z M 

10 CTG AAG AAT CTT TTT GGA AAC 

L K N L F G N 

TGT ACA AGC ACT GTT CAG TTC 

C T S T V 0 F 

15 

ACG TAC AAA CCA AAG TAC ATT 

T Y K F K Y I 

TCC TCC AGG GCC AAC TTT GTG 
20 S S RAN F V 

GCA GTG GCC CTT GGA ATA TTC 

A V A L GIF 

25 AAA TTC AGA ATC AGT GTG TGT 

K F R I S V C 

GGA TCA TCT GTC TTT GGT TAC 

G S S V F G Y 

30 

GCA CTG GGC TGT GAA AAT TCT 

A L G C E N S 



35 TCC TAC CAA GGA ACC AAA CCT 
S Y Q G T K ? 

CTC TTT TCA GAT TGC AAC TCA 
L F S D C N S 

40 

AAA TGG GAA CCC ATG TGC GGT 
K W E P M C G 

TCA GCT TGT CTT GCT GGT TGT 
45 SACLAGC 

AAA AAT ATT ATA TTT TAC AAC 

K N I I F Y N 

50 GCT TCT AAA TCC GGA AAT TCC 

AS K S G N 5 

CAG AAA GAC AAT GGA TGT CCC 
Q K D N G C ? 

55 

GTA ATT TCA GTC ATC ACA TCC 




TCC TCT GAG AAA TCC AAG TTT 

S S Z K S K F 

TAC CAA ACA CCC CAG GGA GAA 

Y 0 T P Q G. E 

GCA AGA .GAT TTT CTT CCA TCA 

A R D F L P S 

CCA GTA TAC TTC CTA TAT TTA 

P V Y F L Y L 

AAT TCT CTG TTC GGC ATG GTG 

H S Li F G M V " 

GAG CAG CAG TAT GGA CAG TCA 

E Q 0 Y G O 3 

ATC GGG CTC ATC AAC ATT CCA 

I G L I N I P 

TCT GGG GGG ATA GTT ATG AAA 

S G G I V M K 

GGA GCT GCA AAA CTC TAC TTG 

G A A K L Y L 

CTC CTA TTT CTT TCC CTG TTT 

L L F L S L F 

GAT GTG GCA GGA CTA ACT GTC 

D V A G L T V 



GTC TCT TAT CAT GAA CGA GCT 

V S Y H E R A 

AGA TGC AAA TGT TCA GAG ACA 
R C K C S E T 

GAA AAT GGA ATC ACA TAT GTA 
E N G I T Y V 

CAA ACC TCC AAC AGG AGT GGA 
Q T S N R S G 

TGC ACT TGT GTG GGA ATT GCA 
C T C V G I A 

TCA GGC ATA GTG GGA AGA TGT 
S G I V G R C 

CAA ATG TTT CTG TAT TTC CTT 
Q M F L Y F L 

TAT ACT TTA TCC CTA GGT GGC 



PCT/US00/13939 

1320 

1362 
1404 
1446 
I486 
1530 
1S72 
1614 
1656 
1698 
1740 

1782 
1824 
1666 
1908 
1950 
1992 
2034 
2076 
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ATA CCT GGA TAC ATA TTA CTT CTG AGG TGC ATT AAG CCA CAG 
IPGVILLLRCIKPQ 



2118 



CTT AAG TCT TTT GCC TTG GGT ATC TAC ACA TTA GCA ATA AGA 
LKSFALG IYTLA I R 



2160 



GTT CTT GCA GGA ATC CCA GCT CCA GTG TAT TTT GGA GTT TTG 
10 VLAG I PAPVYFGVL 



2202 



ATT GAT ACT TCA TGC CTC AAA TGG GGA TTT AAA AGA TGT GGA 

ID.T SCLKWGFKRCG 



2244 



15 AGT AGA GGA TCA TGC AGA TTA TAT GAT TCA AAT GTC TTC AGA 
SRGSC RLYDSNVFR 



2286 



20 



CAT ATA TAT TTG GGA CTA ACT GTG ATA CTG GGC ACA GTG TCA 2328 
HIYLGLTVIL GTVS 

ATT CTC CTA AGC ATT GCA GTA CTT TTC ATT TTA AAG AAA AAT 2370 

ILLS IAVXjFILKKN 



25 



TAT GTT TCA AAA CAC AGA AGT TTT ATA ACC AAG AGA GAA AGA 

YVSKHRSFITKRER 



2412 



ACA ATG GTG TCT ACA AGA TTC CAA AAG GAA AAT TAC ACT ACA 
TMVS TRFQKENYTT 



2454 



30 AGT GAT CAT CTG CTA CAA CCC AAC TAC TGG CCA GGC AAG GAA 
SDHLLQPNYWPGKE 



2496 



35 



40 



45 



50 



ACT CAA CTT TAG AAACATGATG ACTGGAAGTC ATGTCTTCTA 
T Q L 

ATTGGTTGAC ATTTTGCAAA CAAATAAATT GTAATCAAAA G AG CTCTAAA 
TTTGTAATTT CTTTCTCCTT TCAAAAAATG TCTACTTTGT TTTGGTCCTA 
GGCATTAGGT AAT AT AAC TG ATAATATACT GAAATATATA ATGGAAGATG 
CAGATGATAA AACTAATTTT GAACTTTTTA ATTTATATAA ATTATTTTAT 
ATCATTTACT TATTTCACTT TATTTTG CTT TGTGCTCATT GATATATATT 
AGCTGTACTC CTAGAAGAAC AATTGTCTCT ATTGTCACAC ATGGTTATAT 
TTAAAGTAAT TTCTGAACTG TGTAATGTGT CTAGAGTAAG CAAATACTGC 
TAACAATTAA CTC AT AC CTT GGGTTCCTTC AAGTATTACT CCTATAGTAT 
TTTCTCCCAT AGCTGTCTTC ATCTGTGTAT TTTAATAATG ATCTT AG GAT 
G GAG CAG AAC ATG GAG AGG A AGATTTCATT TTAAGCTCCT CCTTTTCCTT 
GAAATACAAT AATTTATATA GAAATGTGTA GCAGCAAATT ATATTGGGGA 
TTAGAATTTT GAATTAATAG CTCTCCTACT ATTAATTTAC ATGTGCTTTT 
TGTGTGGCGC TAT AAG TG AC TATGGTTGTA AAG T AATAAA ATTGATGTTA 

TTTATGAATT CAATGAATTT AAAACTATTG 
CTTTAATATA TGTAAGCAAC TTCCTACTTA 
CATGTTTGAA AGG TG AATTT CTG AAAG TCT 
AACAGGAAAA AAAAAAAAAA AAA 



ACATGCCCAA TTATTGTTCT 
TTAAATATAA TACTGCCCCA 
TACACGACGT GTTCCTAAAA 
CCCATAAATG TAGGTGTTAC 



2538 



2588 
2638 
2688 
2738 
2788 
2838 
2888 
2 938 
2988 
3038 
3088 
3138 
3188 
3238 
3288 
3338 
3381 
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OATP-RP1 (SEO ID NOS:l 1 and 12): 

GGCACGAG GCGCTGCGCG 18 
GCGCGGCGGC CGGGCCCTCG AGACGGGGAC GGACACACCA GCCCCTCGGA 68 

TACCACTTGG CCACTCCCGC TGAGGCCACT CCCACTGCGT GGCTGAAGCC 118 

5 TCGAGGTCAC CAGGCGGAGG CGCGGAG ATG CCC CTG CAT CAG CTG GGG 166 

M P L H Q L G 

GAC AAG CCG CTC ACC TTC CCC AGC CCC AAC TCA GCC ATG GAA 2 08 
DKPLTFPSPNSAME 

!0 

AAC GGG CTT GAC CAC ACC CCA CCC AGC AGG AGG GCA TCC CCG 250 
NG LDHTPPSRRAS P 

GGC ACA CCC CTG AGC CCC GGG TCC CTC CGC TCC GCT GCC CAT 2 92 
15 GTPLSPGSLRSAAH 

AGC CCC CTG GAC ACC AGC AAG CAG CCC CTC TGC CAG CTC TGG 3 34 
SPLDTSKQPLCQLW 

20 GCC GAG AAG CAT GGC GCC CGG GGG ACC CAT GAG GTG CGG TAC 3 76 
AEKHGARGTH EVRY 

GTC TCG GCC GGG CAG AGC GTG GCG TGC GGC TGG TGG GCC TTC 418 
VSAGQ SVACGWWAF 

25 

GCA CCG CCG TGC CTG CAG GTC CTC AAC ACG CCC AAG GGC ATC 4 60 
APPCLQVLNTPKG I 

CTG TTC TTC CTG TGT GCG GCC - GCA TTC CTG CAG GGG ATG ACT 502 
30 LFFLCAA AFLQGMT 

GTG AAT GGC TTC ATC AAC ACA GTC ATC ACC TCC CTG GAG CGC 544 
VNGFINTVITSLER 

35 CGC TAT GAC CTG CAC AGC TAC CAG AGC GGG CTC ATC GCC AGC 586 
RYDLHSYQSGLIAS 



TCC TAC GAC ATT GCC GCC TGC CTC TGC CTC ACC TTC GTC AGC 628 
40 SYDIAACLCLT FVS 

TAC TTC GGG GGC TCA GGG CAC AAG CCG CGC TGG CTG GGC TGG 670 
YFGGSGHKPRWLGW 

45 GGC GTG CTG CTT ATG GGC ACG GGG TCG CTG GTG TTC GCG CTG 712 
GVLLMGTGSLVFAL 

CCC CAC TTC ACG GCT GGC CGC TAT GAG GTG GAG TTG GAC GCG 7 54 
PH FTAGRYEVELDA 

50 

GGT GTC AGG ACG TGC CCT GCC AAC CCC GGC GCG GTG TGT GCG 7 96 
GV RTC PAN PGAVC A 

GAC AGC ACC TCG GGC CTG TCC CGC TAC CAG CTG GTC TTC ATG 83 8 

55 D S T S G L S R Y Q L V F M 
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CTG GGC CAG TTC CTG CAT GGC GTG GGT GCC ACA CCC CTC TAC 880 
LG QFLHGVG A T ? L Y 

ACG CTG GGC GTC ACC TAC CTG GAT GAG AAC GTC AAG TCC AGC 922 
TLGVTVLDENV K S S 

TGC TCG CCC GTC TAC ATT GCC ATC TTC TAC ACA GCG GCC ATC 964 

CS P V V I A I FYTAAI 

CTG GGC CCA GCT GCC GGC TAC CTG ATT GGA GGT GCC CTG CTG 1006 

LG PAAGYL I G G A L L 



AAT ATC TAC ACG GAA ATG GGC CGA CGG ACG GAG CTC ACC ACC 104 8 
15 N I YTEMGRRTELTT 

GAG AGC CCA CTG TGG GTC GGC GCC TGG TGG GTC GGC TTC CTG 1090 
ES PLWVGAWWVGFL 

20 GGC TCT GGG GCC GCT GCT TTC TTC ACC GCC GTT CCC ATC CTT 1132 

GSG AAArFTAVPIL '; 

GGT TAC CCT CGG CAG CTG CCA GGC TCC CAG CGC TAC GCG GTC 1174 
GYPRQLPGSQRYAV 



ATG AGA GCG GCG GAA ATG CAC CAG TTG AAG GAC AGC AGC CGT 1216 
MRAAEMHQLKDSSR 



GGG GAG GCG AGC AAC CCG GAC TTT GGG AAA ACC ATC AGA GAC 1258 
30 GEASNPDFG KT I RD 

CTG CCT CTC TCC ATC TGG CTC CTG CTG AAG AAC CCC ACG TTC 13 00 
LPLSIWLLLKNPTF 

\ 

35 atc ctg ctc tgc ctg gcc ggg gcc acc gag gcc act ctc atc 13 42 
illclagat ea tl: 

acc ggc atg tcc acg ttc agc ccc aag ttc ttg gag tcc cag 13 84 
tgmstfs pkflesq 



TTC AGC CTG AGT GCC TCA GAA GCT GCC ACC TTG TTT GGG TAC 1426 
F SLSASEAATLFGY 



CTG GTG GTG CCA GCG GGT GGT GGC GGC ACC TTC CTG GGC GGC 14 68 

45 LVV-PAGGGG. TFLGG- 

TTC TTT GTG AAC AAG CTC AGG CTC CGG GGC TCC GCG GTC ATC 1510 

FFVNKLRLRGS AVI 

50 AAG TTC TGC CTG TTC TGC ACC GTT GTC AGC CTG CTG GGC ATC 15 52 
KFCLFCTVVSLL-GI 

CTC GTC TTC TCA CTG CAC TGC CCC AGT GTG CCC ATG GCG GGC 15 94 
LVFSLH CPSVPMAG 



GTC ACA GCC AGC TAC GGC GGG AGC CTC CTG CCC GAA GGC CAC 1636 
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G 



L 



? 



G 



10 



15 



20 



25 



30 



35 



40 



45 



50 



CTG AAC CTA ACG GCT CCC TGC AAC GCT GCC TGC AGC TGC CAG 1678 
L N LTAPCNAACSCQ 

CCA GAA CAC TAC AGC CC7 GTG TGC GGC TCG GAC GGC CTC ATG 1720 
PEHVSPVCGSDGLM 

TAC TTC TCA CTG TGC CAC GCA GGG TGC CCT GCA GCC ACG GAG 17 62 
YFSLCHAGCPAATE 

ACG AAT GTG . GAC GGC CAG AAG GTG TAC CGA GAC TGT AGC TGT 1804 

T N V - ' D G 0 KV Y R D C S C 

ATC CCT CAG AAT CTT TCC TCT GGT TTT GGC CAT GCC ACT GCA 184 6 

I p Q NLSSGFG H A T A . 

GGG AAA TGC ACT TCA ACT TGT CAG AGA AAG CCC CTC CTT CTG 1888 

GKCTSTCQRKPLLL 

GTT TTC ATA TTC GTT GTA ATT TTC TTT ACA TTC CTC AGC AGC 1930 

VFlrVVIFFTFLSS 

ATT CCT GCA CTA ACG GCA ACT CTA CGA TGT GTC CGT GAC CCT 1972 
I pALiTATLRCVRDP 

CAG AGA TCC TTT GCC CTG GGA ATC CAG TGG ATT GTA GTT AGA 2014 
Q R S FALGIQW IVVR 

ATA CTA GGG GGC ATC CCG GGG CCC ATC GCC TTC GGC TGG GTG 2056 
IL GGIPGPIAFG WV 

ATC GAC AAG GCC TGT CTG CTG TGG CAG GAC CAG TGT GGC CAG 2 098 
IDKACLL WQDQCGQ 

CAG GGC TCC TGC TTG GTG TAC CAG AAT TCG GCC ATG AGC CGC 214 0 
QGSCLVVQNSAMSR 

TAC ATA CTC ATC ATG GGG CTC CTG TAC AAG GTG CTG GGC GTC 2182 

Y-ILIMGLZ.YKVLGV 

CTC TTC TTT GCC ATA GCC TGC TTC TTA TAC AAG CCC CTG TCG 2224 
LFFAIA CFLY KPLS 

GAG TCT TCA GAT GGC CTG GAA ACT TGT CTG CCC AGC CAG TCC 2266 
ESS DGLHTCL ?SQS 

TCA GCC CCT GAC AGT GCC ACA GAT AGC CAG CTC CAG AGC AGC 2308 
SAP DSATDSQLQSS 

GTC TGA CCACCGCCCG CGCCCACCCG GCCACGGCGG GCACTCAGCA 2354 



TTTCCTGATG ACAGAACAGT G CCG TTG GGT GATGCAATCA CACGGGAACT 24 04 
TCTATTTGAC CT3CAACCTT CTACTTAACC TGTGGTTTAA AGTCGGCTGT 2454 
GACCTCCTGT CCCCAGAGCT GTACGGCCCT GCAGTGGGTG GGAGGAACTT 2504 



V 
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GCATAAATA7 ATATTTATGG ACACACAG77 TGCATCAGAA CGTGTTTATA 2554 

GAATGTGTTT TATACCCGAT CG7GTGTGG7 GTGCGTGAGG ACAAACTCCG 2 604 

CAGGGGCTGT GAATCCCACT GGGAGGGCGG CGGGCCTGCA GCCCGAGGAA 2654 

GGCTTGTGTG TCCTCAGTTA AAACTGTGCA TATCGAAATA TATTTTGTTA 27 04 

TTTAAGCCTG CGAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2754 

AAAAAAAAA 2763 



Persons skilled in the an can also modify the nucleic acids coding for the 

10 OATPs of the present invention to prepare useful mutations. For example, one may 
modify the sequence to provide additional restriction endonuclease recognition sites 
in the nucleic acid. Such mutations may be silent or may change the amino acid 
encoded by the mutated codon. One can prepare these modified nucleic acids, for 
example, by mutating the nucleic acid coding for an OATP of the present invention to 

15 result in deletion, substitution, insertion, inversion or addition of one or more amino 
acids in the encoded polypeptide. For methods of site-directed mutagenesis, see 
Taylor, J. W. et al. (1985), Nucl. Acids Res. 13, 8749-64 and Kunkel, J. A. (1985), 
Proc. Natl. Acad. Sci. USA 82: 482-92. In addition, kits for site-directed mutagenesis 
are available from commercial vendors (e.g., BioRad Laboratories, Richmond, CA; 

20 Amersham Corp., Arlington Heights, IL). For disruption, deletion and truncation 
methods, see Savers, J. R. et al. (1988), Nucl. Acids Res. 16: 791-800. 

This invention also comprises modified nucleic acids, including (1) alternative 
splice exon variants; (2) allelic variants: and (3) chimeric proteins in which the fusion 
construct comprises an OATP or fragment thereof. Such modified nucleic acids can 

25 be obtained by persons of ordinary skill in the an when armed with the present 
disclosure. 

Expression vectors 

This invention further concerns expression vectors comprising a nucleotide 
sequence encoding an OATP of the present invention. Preferably, the expression 
30 vectors comprise all or a portion of the nucleic acid sequence as shown in SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9 ? or SEQ ID 
NO: 11; preferred is a nucleotide sequence encoding an OATP as shown above (i.e., 
the coding region). 

Expression vectors are usually plasmids. but the invention includes other 
35 vector forms that serve equivalent functions and become known in the art 
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subsequently hereto. A person skilled in the art might also stably integrate a sequence 
encoding an OATP into the chromosome of an appropriate host cell. 

Expression vectors typically contain regulatory elements capable of affecting 
expression of an OATP. These regulatory elements can be heterologous or native 

5 OATP elements. Typically, a vector contains an origin of replication, a promoter, and 
a transcription termination sequence. The vector may also include other regulatory 
sequences, including mRNA stability sequences, which provide for stability of the 
expression product; secretory leader sequences, which provide for secretion of the 
expression product; environmental feedback sequences, which allow expression of the 

10 structural gene to be modulated (e.g., by the presence or absence of nutrients or other 
inducers in the growth medium); marking sequences, which are capable of providing 
phenotypic selection in transformed host cells; restriction sites, which provide sites 
for cleavage by restriction endonucleases; and sequences which allow expression in 
various types of hosts, including prokaryotes, yeasts, fungi, plants and higher 

15 eukaryotes. 

An expression vector of this invention is at least capable of directing the 
replication, and preferably the expression, of the nucleic acids and protein of this 
invention. Suitable origins of replication include, for example, the Col El, the SV40 
viral, Epstein Ban* viral, and the Ml 3 origins of replication. Suitable.promoters 

20 include, for example, the cytomegalovirus promoter, the lacZ promoter, the gal 10 
promoter and the Autographa californica multiple nuclear polyhedrosis virus 
(AcMNPV) polyhedral promoter. Suitable termination sequences include, for 
example, the bovine growth hormone. SV40, lacZ and AcMNPV polyhedral 
poiyadenylation signals. Examples of selectable markers include neomycin, 

25 ampicillin, and hygromycin resistance and the like. 

Persons skilled in the an may insert DNA encoding An OATP of the present 
invention into several commercially available vectors. Examples include vectors 
compatible with mammalian cells, such as pcDNA3 or pCEP4; baculovirus vectors 
such as pBlueBac; prokaryotic vectors such as pcDNA2; and yeast vectors such as 

30 pYes2. For vector modification techniques, see Sambrook et al. ( 1 989), Molecular 
Cloning : A Laboratory Manual . Second Edition, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY. 
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Host cells 

This invention additionally concerns host cells containing an expression vector 
that comprises a sequence encoding an OATP, preferably the OATP2. OATP-RP2, 
OATP-RP3, OATP-RP4, OATP-RP5 or OATP-RP1 of the present invention. The 
5 host cells preferably contain an expression vector which comprises all or part of the 
DNA sequence having the nucleotide sequence substantially as shown in SEQ ID 
NO:l, SEQ ED NO:3. SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, or SEQ ID 
NO:l 1, particularly the coding regions thereof. Suitable host cells include both 
prokaryotic cells (e.g., E. coli strains KB 101, DH5a. XL1 Blue, Y1090 and JM101) 
10 and eukaryotic cells (e.g., Spodoptera frugiperda insect cells. CHO cells, COS-7 cells, 
HEK 293 cells, human skin fibroblasts, and S. cerevisiae cells). 

Persons skilled in the art may introduce expression vectors into host cells by . 
various methods known in the art. Exemplary methods are transfection by calcium 
phosphate precipitation, electroporation, liposomal fusion, nuclear injection, and viral 
15 or phage infection. One may then culture the host cell under conditions permitting 
expression of large amounts of OATP. 

One may identify such modified host cells by any of five general approaches: 

(a) DNA-DNA hybridization with probes complementary to the sequence 
encoding an OATP (Southern blotting). 
20 (b) detection of marker gene functions, such as thymidine kinase activity, 

resistance to antibiotics, and the like. A marker gene can be placed in the same 
plasmid as an OATP sequence under the regulation of the same or a different 
promoter. 

(c) detection of mRNA transcripts by hybridization assays (e.g., Northern 
25 blotting or a nuclease protection assay using a probe complementary to the RNA 

sequence). 

(d) immunodetection of gene expression (e.g., by Western blotting with 
antibody to OATP). 

(e) PCR with primers homologous to expression vector sequences or 

30 sequences encoding OATP. The PCR produces a DNA fragment of predicted length, 
indicating incorporation of the expression system in the host cell. 
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Persons skilled in ihe an may determine DNA sequences by various known 
methods. See, for example, the dideoxy chain termination method in Sanger et al. 
(1977), Proc. Natl . Acad , Sci. USA 74: 5463-7 and the Maxam-Gilben method in 
Maxam-Gilben (1977), Proc. Natl. Acad . Sci. USA 74: 560-4. 
5 One may use the host cells of this invention in a variety of ways that are now 

apparent. One may use the cells to screen for compounds that bind to or otherwise 
modulate or regulate the function of an OATP of the present invention; which would 
be useful for modulation, for example activation or inactivation. of OATP2, OATP- 
RP2 ? OATP-RP3, OATP-RP4, OATP-RP5 or OATP-RP1 activity; to study signal 

10 transduction mechanisms and protein-protein interactions; and to prepare OATP for 
the uses described below. 

Not all expression vectors and DNA regulatory sequences will function 
equally well to express the DNA sequences of this invention. Neither will all host 
cells function equally well with the same expression system. However, one of 

15 ordinary skill in the art may make a selection among expression vectors, DNA 
regulatory sequences, and host cells using the guidance provided herein without 
undue experimentation and without departing from the scope of the invention. 



20 amino acid sequences of OATPs of the present inventiqn. The inventors prefer 
polypeptides comprising all or a portion of the amino acid sequences shown as in 
SEQ ID NO:2 (OATP2), SEQ ID NO:4 (OATP-RP2), SEQ ID NO:6 (OATP-RP3), . 
SEQ ID NO:8 (OATP-RP4), SEQ ID NO: 10 (OATP-RP5) or SEQ ID NO: 12 (OATP- 
RP1). Where a portion of an OATP of the present invention is used, preferably the 

25 portion exhibits the same biological activity of the OATP from which the portion is 
derived. For example, and within the scope of the invention, are polypeptides that 
comprise all or a portion of OATP2, OATP-RP2, OATP-RP3, OATP-RP4, OATP- 
RP5 or OATP-RP1 that exhibit transport activity. The portions may contain one or 
more mutations so that the protein(s) fail(s) to exhibit transport activity, but that can 

30 be used to screen for compounds that will modulate or bind to the protein or portion 
thereof. 



Polypeptides 

This invention further concerns polypeptides comprising all or a portion of the 
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Persons having ordinary skill in the an may prepare these polypeptides by 
methods known in the art. For example, one may use chemical synthesis, such as the 
solid phase procedure described by Houghton et al. ( 1 985), Proc. Natl . Acad . Sci . 82: 
5131-5. Another method is in vitro translation of mRNA. One may also produce the 
5 polypeptides in the above-described host cells, which is the preferred method. For 
example, one may synthesize DNA comprising all or a portion of SEQ ID NO:l, SEQ 
ED NO:3, SEQ ID NO:5. SEQ ID NO:7, SEQ ID NO:9, or SEQ ID NO:l 1 by PGR as 
described above, insert the synthesized DNA into an expression vector, transform a 
host cell with the expression vector and culture the host cell to produce the desired 
10 polypeptides. 

Persons skilled in the art can isolate and purify such polypeptides by any one 
of several known techniques: for example, ion exchange chromatography, gel 
filtration chromatography and affinity chromatography. Such techniques may require 
modification of the protein. For example, one may add a histidine tag to the protein to 

15. enable purification on a nickel column. 

Persons skilled in the art can use the polypeptides of the invention in a wide 
variety of ways. For example, one may use them to generate polyclonal or 
monoclonal antibodies. One may then use such antibodies for immunodetection (e.g., 
radioimmunoassay, enzyme immunoassay, or immunocytochemistry), 

20 immunopurification (e.g., affinity chromatography) of polypeptides from various 
sources, or immunotherapy. 

Persons skilled in the an may make modified OATP polypeptides by known 
techniques. Such modifications may cause higher or lower activity, permit higher 
levels of protein production, or simplify purification of the protein. Such 

25 modifications may help identify specific OATP amino acids involved in binding, 
which in turn may help rational drug design of OATP modulators. One can make 
amino acid substitutions based on similarity in polarity, charge, solubility, 
hydrophobicity, hydrophiliciry and/or the amphipathic nature of the residues involved. 
For example, negatively charged amino acids include aspanic acid and glutamic acid; 

30 positively charged amino acids include lysine and arginine; amino acids with 

uncharged polar head groups or nonpolar head groups having similar hydrophiiicity 
values include the following: leucine, isoieucine, valine, glycine, alanine; asparagine, 
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gluiamine; serine, threonine; phenylalanine, tyrosine. All such modified polypeptides 
are included within the scope of the invention. 

Preferred analogs include proteins that differ from the novel OATPs of the 
present invention (or biologically active fragments thereof) by one or more 

5 conservative amino acid substitutions or by one or more non-conservative amino acid 
substitutions, deletions or insertions which do not abolish the biological activity of the 
analog. Conservati ve substitutions typically include the substitution of one amino 
acid for another with similar characteristics, e.g., substitutions within the following 
groups: valine, glycine; glycine, alanine; valine, isoleucine, leucine; aspartic acid, 

10 glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; and 

phenylalanine, tyrosine. Other conservative amino acid substitutions can be taken 
from the table below. 

Table 1 



Conservative amino acid replacements 

15 



For Amino Acid 


Code 


Replace with any of: 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, He, D- 
Met, D-Ile, Orn, D-Om 


Asparagine 


N 


D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, D-Gln 


Aspartic Acid 


D 


D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, D-Gln 


Cysteine 


C 


D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr 


Glutamine 


Q 


D-Gln. Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


Glutamic Acid 


E 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin. D-Gln 


Glycine 


G 


Ala, D-Ala, Pro, D-Pro, B-Ala, Acp 


Isoleucine 


I 


D-Ile, Val, D-VaL Leu, D-Leu. Met. D-Met 


Leucine 


L 


D-Leu, Val, D-Val, Met. D-Met 


Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D- 
Met, He, D-Ile, Orn, D-Om 


Methionine 


M 


D-Met, S-Me-Cys, lie, D-lle, Leu, D-Leu, Val, D-Val 


Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Tip, 
Trans-3,4, or 5-phenylproline. cis-3.4, or 5-phenylproline 


Proline 


P 


D-Pro, L-l-thioazolidine-4-carboxylic acid, D- or L-l- 
oxazolidine-4-carboxylic acid 


Serine 


S 


D-Ser, Thr, D-Thr, allo-Thr, Met. D-Met, Met(O), D- 
Met(O), L-Cys, D-Cys 


Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(O), D- 
Met(O), Val, D-Val 


Tyrosine 


Y 


D-Tyr, Phe, D-Phe. L-Dopa, His, D-His 


Valine 


V 


D-Val. Leu, D-Leu, He. D-Ile, Met, D-Met 
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Other analogs within the invention are those with modifications which 
increase protein or peptide stability; such analogs may contain, for example, one or 
more non-peptide bonds (which replace the peptide bonds) in the protein or peptide 
sequence. Also included are analogs that include residues other than naturally 
5 occurring L-amino acids, e.g., D-amino acids or non-naturally occurring or synthetic 
amino acids, e.g., B or y amino acids. 

The inventors contemplate a number of other variations of the above-described 
polypeptides. Such variations include salts and esters of the polypeptides, as well as 
precursors of the aforementioned polypeptides (e.g., having N-terminal substituents 
10 such as methionine, N-formyimethionine and leader sequences). The invention 
includes all such variations. 

Method for detecting nucleic acids 

The present invention further concerns a method for detecting nucleic acids 
encoding OATP proteins. In this method, a person of ordinary skill in the art (a) 

15 contacts nucleic acids of unknown sequence with a nucleic acid having a sequence 
complementary to a known coding sequence (e.g., a sequence of at least about 10 
nucleotides from, e.g., SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, 
SEQ ED NO:9, or SEQ ID NO:l 1, particularly the coding regions thereof), wherein 
the latter nucleic acid has a detectable marker; and (b) determines the presence of 

20 marker bound to any of the nucleic acids of unknown sequence. The presence of 

bound marker indicates the presence of the desired nucleic acids. One can apply this 
method to detect OATP nucleic acids from other tissues (which may have different 
regulatory elements) and nucleic acids from other species (e.g., monkey). 

Persons of ordinary skill in the art generally know how to obtain nucleic acids 

25 to be analyzed in this method. For genomic DNA, one can rapidly freeze tissue, crush 
the tissue into readily digestible pieces, and incubate the crushed tissue in proteinase 
K and SDS to degrade most cellular proteins. One can then deproteinize the genomic 
DNA by successive phenol/chlorofomvisoamyl alcohol extractions, recover DNA by 
ethanoi precipitation, dry it and resuspend it in buffer. For RNA. one can lyse 

30 cultured cells in 4M guanidinium solution, draw the lysate through a 20-gauge needle, 
pellet the RNA through a cesium chloride step gradient, and remove the supernatant. 
The'pellet should contain purified RNA. 
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The detectable marker may be a radioactive ion linked to one of the 
nucleotides of the complementary nucleic acid. Common radioactive labels are 32p 
and ^S. although one may also use other labels such as biotin. Persons skilled in the 
art are aware of various methods to attach the labels to the complementary nucleic 
5 acid (e.g., the random primer method for attachment of 32 P or 35 S). 

Persons of ordinary skill in the an generally know how to carry out such a 
method of detecting nucleic acids. For example, one may perform a Southern or 
northern blot using a radiolabeled OATP complementary oligonucleotide probe. One 
can then detect hybridization by autoradiography. Depending on the marker, one may 
10 also use other detection methods (e.g., spectrophotometry). 

Methods for detecting OATP modulators and compounds transported by the 

OATPs of the present invention 

This invention further concerns methods for detecting modulators of the 
OATPs of the present invention, as well as methods for detecting compounds that are 

1 5 transported by the OATPs of the present invention (e.g., compounds that are 

transported into the liver that may be used as carriers for other compounds). A screen 
for OATP modulators entails detecting binding of molecules (e.g., polypeptides, 
natural products, synthetic compounds) in cells expressing OATP protein. 
Alternatively, a screen for OATP positive modulators and/or negative modulators 

20 entails detecting the augmentation and/or inhibition of transport of a known 

compound. A screen for OATP-transported compounds emails detecting the transport 
of molecules (e.g., polypeptides, natural products, synthetic compounds) by an OATP. 

Gloning and sequencing of the OATPs of the present invention enables 
construction of cells useful in screening for natural products and synthetic compounds 

25 that bind to, modulate, and/or are transported by OATP activity, A process for 

detecting OATP modulators requires transforming a suitable vector into compatible 
host cells as described previously herein. One treats such transformed cells with test 
substances (e.g., synthetic compounds or natural products), and then measures activity 
in the presence and absence of the test substance. 

30 OATP Assay 

An assay for the measurement of OATP activity is performed as follows: 
HEK293 cells are plated in Dulbeccos Modified Eagles Medium (DMEM) plus 10% 
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fetal bovine serum plus peneciilin and streptomycin, in poly-d-Iysine coated dishes 
and co-transfected with OATP transporter expression plasmids using Lipofectamine 
Plus (Life Technologies, inc.). The cells and media are assayed for substrate transport 
24 hours later. Alternatively, cell lines engineered to stably express OATPs could be 

5 plated and assayed directly without transfection. To measure transport, media is 
removed and monolayers are assayed in triplicate by washing once in serum-free 
DMEM and adding the same medium containing [ 3 H]-substrate alone or in the 
presence of various concentrations of unlabeled test compounds. For OATP2, the 
[ 3 H]-substrate could be [ 3 H]-pravastatin, [ 3 H]-taurocholate, or [ 3 H]- 

10 dehydroepiandrosterone sulfate, or [ l25 I]-thyroid honnone (T4). Monolayers are 
incubated at room temperature for 5 to 10 minutes depending on the transporter. 
Then the cells are rapidly washed once with ice cold DMEM containing 5% BSA, 
twice with DMEM plus 0.1% BSA and once with DMEM alone. Cells are lysed in 
0.1 N NaOH and a fraction of the lysate is used to determine radiolabel incorporation 

15 by liquid scintillation counting, and another is used to determine protein concentration 
in the lysate using the Bradford assay with BSA as a standard. The transport activity 
is expressed as moles of substrate transported into cells/mg of cell protein/minute. 
Drue Targeting 

Also included within the present invention is tissue expression of an OATP of 
20 the present invention. The OATPs of the present invention are also useful for 
targeting drugs to certain organs that express an OATP described herein (e.g., the 
liver), and for modulating the concentration of endogenous substrates. 

For example, the novel organic anion transporter disclosed herein, OATP2, 
represents a potential therapeutic target due to its ability to modulate the cellular 
25 uptake and potential secretion of a several biologically important organic anions, 
including bile acids and the androgen honnone dehydroepiandrosterone sulfate 
("DHEAS"). Furthermore, since OATP2 transports at least one drug (i.e. pravastatin), 
and other members of this family are known to transport a variety of other 
xenobiotics. this transporter could be exploited to optimize the delivery of drugs into 
30 liver and away from other tissues. 

OATP2 is unique among the OATP family, in that it is the only known 
organic anion transporter that is expressed exclusively in the liver. Thus, drugs 
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optimized for this transporter could be targeted for hepatic delivery with greater 
selectivity than with any other known transporter. To generalize this approach, it may 
be possible to identify a small molecule "adaptor" that is efficiently recognized and 
transported by OATP2 (an OATP2- transported compound) that could be appended to 
other drugs for hepatic targeting even if the parent compound is not transported by 
OATP2. 

Alternatively, if a therapeutic compound is taken up into the liver entirely or 
substantially by OATP2, one could inhibit hepatic clearance and thereby elevate 
circulating concentrations, or increase the compounds half-life in the periphery, by 
adding a functionality to said compound that disallows transport by OATP2. 
Likewise, if an endogenous substance utilizes OATP2 for liver uptake and clearance 
from the circulation, a competitive or non-competitive OATP2 inhibitor could elevate 
plasma levels of said substance. As an example, DHEAS is an adrenal androgen that 
declines with age and on the basis of some animal data, it has been suggested that 
replacement of DHEAS deficiency may stimulate age-related immune deficiencies, 
increase cognitive function and insulin sensitivity, and maintain bone mass. 
Inhibiting the hepatic clearance of endogenous DHEAS through blocking its 
interactions with OATP2 could result in elevated hormone levels in the absence of 
hormone supplementation. 

i 

With the information provided herein, one skilled in the an is able to identify 
molecules, both naturally occurring and synthetic (including therapeutic drugs), that 
are transported by the OATPs, e.g., OATP2, disclosed herein. OATPs as a class 
generally exhibit broad substrate specificity Cpolyspecific" transporters). Thus, it is 
anticipated that many additional substrates of these transporters will be identified. 

Gene Therapy 

Persons skilled in the an can also use sense and antisense nucleic acid 
molecules as therapeutic agents for OATP-related indications. One may construct 
vectors that direct the synthesis of the desired DNA or RNA or formulate the nucleic 
acid as described in the art. 

Several references describe the usefulness of antisense molecule. See Toulme 
and Helene (1988), Gene 72: 51-8; Inouye (1988), Gene, 72: 25-34; Uhlmann and . 
Peyman (1990), Chemical Reviews 90: 543-584; Biotechnology Newswatch (January 
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15. 1996), p. 4; Robertson. Nature Biotechnology 15: 209 (1997); Gibbons and Dzau 
(1996), Science 272: 689-93. One can design them based on genomic DNA and/or 
cDNA. 5' and 3' flanking control regions, other flanking sequences, intron sequences, 
and nonclassic Watson and Crick base pairing sequences used in formation of triplex 
5 DNA. Such antisense molecules include antisense oligodeoxyribonucleotides, 
oligoribonucleotides. oiigonucleotide analogues, and the like, and may comprise at 
least about 15 to 25 bases. 

Antisense molecules may bind noncovalently or covalentiy to the OATP DNA 
or RNA. Such binding could, for example, cleave or facilitate cleavage of OATP 

10 DNA or RNA. increase degradation of nuclear or cytoplasmic mRNA, or inhibit 

transcription, translation, binding of transactivating factors, or pre-mRNA splicing or 
processing. Antisense molecules may also contain additional functionalities that 
increase stability, transport into and out of cells, binding affinity, cleavage of the 
target molecule, and the like. All of these effects would decrease expression of OATP 

15 protein and thus make the antisense molecules useful as OATP modulators. 



The following examples are included for understanding the present invention 
and are not intended to limit the scope of Applicants invention, which is defined 
20 solely by the claims. 



Isolation of OATP2. OATP-RPL OATP-RP2, OATP-RP3. OATP-RP4 and OATP- 
RP5 full length cDNAs and cloning into mammalian expression vectors 
Human OATP2 was identified by searching the public EST databases for 
25 sequences homologous to human OATP. One EST sequence, Genbank accession 
number T73863, encoded a partial cDNA with significant sequence identity with 
OATP. EST sequences encoding partial cDNAs for OATP-RP1, OATP-RP2, OATP- 
RP3, OATP-RP4, and OATP-RP5 were identified by searching the public EST 
databases and the Incyte. Inc. EST database for sequences homologous to human 
30 OATP. The EST clone IDs corresponding to OATP-RP1 are 8201 17, 2668489, 

1610706, 2972518, and 588148. These clones represent a contig encoding only part of 
the full length cDNA. The Incyte EST clone IDs corresponding to OATP-RP2 are 



EXAMPLES 



Example 1 
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1664737 and 2641944. These clones represent a contig encoding only pan of the full 
length cDNA. The Incyte EST clone IDs corresponding to OATP-RP3 are 2493241, 
2497845, and 2664024. These clones represent a contig encoding only part of the full 
length cDNA. The Incyte EST clone IDs corresponding to OATP-RP4 are 1494683 

5 and 1685219. These clones represent a contig encoding only part of the full length . 
cDNA. The Incyte EST clone ID corresponding to OATP-RP5 is 925716. This clone 
encodes only part of the full length cDNA. Full length clones for each of the above 
genes were obtained using the Gene Trapper cDNA Positive Selection System 
(LifeTechnologies, Inc.). In this procedure, a single or multiple oligonucleotides 

10 complementary to each of the EST contigs or individual EST sequences, were 
biotinyiated at the 3'-end and used to hybridize to a single-stranded human cDNA 
library constructed in pCMVSport2 (LifeTechnologies, Inc.). The sequence of 
oligonucleotides used for each gene as well as the tissue source of the libraries 
screened are shown in Table 2. 

15 Table 2 

Oligonucleotides used to screen for OATP Full length cDNAs using Gene-Trapper 

Selection 







Seq ID number 


Human cDNA 






of 


library 


Gene 


Biotinyiated capture oligonucleotides) used 


oligonucleotide 


screened 


OATP2 


5 '-ACCCTGTCTAGCAGGTTGCA-3 * , 


13 


liver 


OATP-RP1 


5'*CTGTCGGAGTCTTCAGATG-3' 


14 


brain 


OATP-RP2 


5'-TCCATCACAGCCTCCTACGC-3* 


15 


liver 


OATP-RP3 


5'-TGCCTCTACTCTGACCCTAGr3' 


16 


heart 


OATP-RP4 


5'-GGAGCAGTCATTGACACCAC-3' 
S'-TGCTGGGAGTACAACGTGACGO' 
5'-ACAAGGAGGATGGACTGCAG-3' 


17. 
18 
19 


heart 


OATP-RP5 


5'-CAGGAATCCCAGCTCCAGTG-3' 
5 , -GCTACAACCCAACTACTGGC-3 , 
S'-GGGACTAACTGTGATACTGG-S' 


20 
21 
22 


brain 



Hybrids between the biotinyiated oligonucleotides and single-stranded cDNA were 
20 captured on streptavi din-coated paramagnetic beads. After washing, the captured 

single-stranded cDNA targets was released from the biotinyiated oligonucleotides and 
converted to dsDNA by DNA polymerase using the corresponding unbiotinylated 
oligonucleotide. Following transformation and plating, several positive clones for 
each gene were identified by PCR analysis. Full-length cDNA clones were identified 



47 

SUBSTITUTE SHEET (RULE 26) 





WO 00/71566 



PCT/US00/13939 



by sequencing. In the case of OATP-RP1, a partial cDNA was obtained by the above 
technique (pSP-RPl A). Another cDNA clone that was part of the OATP-RP1 contig 
was identified by searching the public EST databases (Genbank accession number 
AI027850). An EcoRI-NotI fragment of this clone containing the first 477 nucleotides 
5 of OATP-RP1 (SEQ ID NO: 1 1) (obtained from Research Genetics, Inc.) was ligated 
to EcoRI-Not I digested pSP-RPlA to generate the full length sequence. 

Two polymorphic positions were identified when sequencing multiple OATP- 
RP4 cDNA clones. Thus, nucleotide number 713 of SEQ ID NO: 7 can be either a C, 
encoding Leu in SEQ ID NO:8, or a T, encoding a Phe in SEQ ID NO:8. Similarly, 
10 nucleotide number 2397 of SEQ ID NO: 7 can be either a G, encoding a Gly in SEQ 
ID NO:8 , or a T, encoding a Val in SEQ ID NO:8. 

For expression studies, OATP2 cDNA was cloned into the expression vector 
pCEP4pR, a modified form of pCEP4 (lnvitrogen, Inc.) in which the CMV promoter- 
driven expression cassette has been inverted, and used in transient transfections. To 
15 accomplish this, OATP2 cDNA in pCMVSport2, correponding to nucleotides 59 
through 2361 of SEQ ID NO:l, was excised by digestion with Kpnl and Notl. This 
fragment was cloned into KpnI-NotI digested pCEP4(}R. This clone, pCEP-OATP2 
was used for transient transfection expression studies. 

20 Example 2 

Tissue and cellular distribution of OATP2, OATP-RPl,'OATP-RP2, OATP-RP4, and 



The tissue distribution of OATP2, OATP-RP1, OATP-RP2, OATP-RP4, and 
OATP-RP5 expression was determined by Northern blotting of poly A+ RNA from a 

25 variety of human tissues (Figure 1). Transporters of this family previously described 
in the literature, namely human OATP, rat oatpl, rat oatp2 and rat oatp3, are all 
expressed in liver, kidney and brain. All of the above transport bile acids as well as a 
variety of other substrates that are specific for subsets of these transporters. In 
contrast, the expression of OATP2, which also transports bile acids T is very hepato- 

30 specific; a major 3.2 kb and several minor hybridizing bands were observed only in 
RNA from liver and no other tissue. The specific cell types that express this 
transporter were examined by in situ hybridization of OATP2 riboprobe to human 
liver samples. Strong hybridization signal was seen localized to hepatocytes 



OATP-RP5 
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throughout the liver iobule with no significant difference in signal intensity among 
centrilobular, midzonal or periportal regions. No signal was observed in bile ducts, 
Kupffer cells, or blood vessels, nor in any cell types from human lung (data not 
shown). 

5 OATP-RP1 is expressed in nearly all tissues tested with highest abundance in 

skeletal muscle, lung, placenta, and heart. OATP-RP2 is ubiquitously expressed in all 
tissues tested. OATP-RP4 has a much more restricted partem of expression with 
abundant transcipts in skeletal muscle and heart and much less in prostate and thymus. 
The expression of OATP-RP5 is likewise tissue specific, with brain and testes being 
10 the only sites where transcripts were detected. 

Example 3 
Expression of OATP2 in transfected cells 
293EBNA cells (Invitrogen, Inc.), an HEK293 cell derivative, were transiently 
1 5 transfected with the OATP2 expression vector pCEP-OATP2, or the pCEP4 vector 
alone (MOCK) and the transport of [ 3 H]-labeled substrates was determined 24 hours 
later. Figure 2A shows specific uptake of [ 3 H]-pravastatin and [ 3 H]-DHEAS. Figures 
2B and 2C show the specific uptake of [ 3 H]-taurocholate and [125I]-thyroid hormone 
(T4), repsectively. The uptake of radiolabeled substrate for 5 minutes into cells 
20 transfected with pCEP-OATP2 or empty vector (MOCK) was determined in the 
absence (solid bars) and presence (open bars) of excess unlabeled substrate. Thus, 
OATP2 is a liver specific human transporter of at least some HMG CoA reductase 
inhibitors, bile acids, adrenal steroids, and thyroid hormone. 
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We claim: 



A purified and isolated nucleic acid sequence encoding all or a portion 



10 



15 



20 



25 



of an organic anion transport protein ( 44 OATP")> said OATP comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO:2 (OATP2), SEQ ID 
NO:4 (OATP-RP2), SEQ ID NO:6 (OATP-RP3), SEQ ID NO:8 (OATP-RP4), SEQ 
ID NO:10 (OATP-RP5), and SEQ ID NO:12 (OATP-RP1). 

2. The nucleic acid sequence of claim 1 comprising (a) a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 ? SEQ ID NO:9, and SEQ ID NO:l 1; (b) the coding region of (a); 
(c) the complement of (a) or (b); or (d) nucleic acid sequences that differ from (a), (b) 
or (c) due to degeneracy of the genetic code. 

3. An expression vector comprising a nucleic acid molecule as claimed in 
claim 1 or 2 and an expression control sequence operatively linked to the nucleic acid 
molecule. 

4. A transformant host cell including an expression vector comprising a 
nucleic acid molecule as claimed in claim 1 or 2 and an expression control sequence 
operatively linked to the nucleic acid molecule. 

5. An OATP protein comprising the amino acid sequence selected from 
the group consisting of SEQ ID NO:2 (OATP2), SEQ ED NO:4 (OATP-RP2), SEQ ID 
NO:6 (OATP-RP3), SEQ ID NO:8 (OATP-RP4), SEQ ID NO:10 (OATP-RP5), and 
SEQ ID NO:12 (OATP-RP1). 

6. A modified OATP protein comprising an OATP of claim 5 that 
maintains an activity of said OATP protein of claim 5, wherein said modified OATP 
protein comprises at least one amino acid substitution or deletion. 

7. A method of producing OATP, said method comprising the steps of: 
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a) inserting a nucleic acid sequence according to claim 1 or 2 
encoding said OATP protein, or a homologue thereof, into an appropriate expression 
vector, 

b) transfecting said expression vector into an appropriate 
5 transfection host cell, 

c) growing said transfected host cells in an appropriate culture 

media, and 

d) purifying the OATP protein, or a homologue thereof, from said 
culture media. 

10 

8. An isolated nucleic acid sequence which hybridizes under stringent 
conditions to the nucleic acid sequence of claim 1 or 2, wherein said nucleic acid 
sequence contains at least 1 8 contiguous nucleotides from SEQ ID NO: 1, SEQ ID 
NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9 or SEQ ID NO:l 1. 

15 

9. An antibody specific for the OATP as claimed in claim 5. 

10. The antibody of claim 9 wherein said antibody is a monoclonal 
antibody. 

20 • 

1 1 . The OATP of claim 5, produced by: 

a) inserting a nucleic acid sequence encoding said OATP into an 
appropriate expression vector, 

b) transfecting said expression vector into an appropriate 
25 transfection host cell, 

c) growing said transfected host cells in an appropriate culture 

media, and 

d) . purifying the OATP from said culture media. 

30 12. A method for identifying a ligand which is capable of binding to the 

OATP of claim 5, or to a part of said OATP. said method comprising the steps of: 
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(a) reacting said O ATP. or part of said OATP, with said ligand which 
potentially is capable of binding to the OATP or part of said OATP, under conditions 
which permit the formation of ligand-OATP complexes; and 

(b) assaying for ligand-OATP complexes, for free ligand, or for non- 
5 complexed OATP. 

13. A method for identifying a substrate which is capable of being 
transported by the OATP of claim 5, or a part of said OATP, said method comprising 
the steps of: 

10 (a) reacting said OATP, or part of said OATP, with said substrate which is 

potentially capable of being transported by the said OATP or part of said 
OATP, under conditions which permit the movement of said substrate 
across a membrane; and 
(b) assaying for the movement of said substrate across the membrane. 

15 

1 4. A method of deli vering a molecule to a an organ that expresses an 
OATP protein of claim 5, said method comprising: 

(a) identifying a substrate that is transported by said OATP; 

(b) joining said substrate to said molecule to be delivered to form a substrate- 
20 molecule fusion compound; and 

(c) providing said substrate-molecule fusion compound to said organ. 

15. A fusion protein comprising all or a portion of the OATP of claim 5, 
attached to a second polypeptide. 

25 

16. A method for identifying a modulator which is capable of augmenting 
or inhibiting the transport of a substrate by the OATP of claim 5, or a part of said 
OATP, said method comprising: 

a) reacting said OATP, or part of said OATP, with said substrate and said 
30 modulator which potentially is capable of augmenting or inhibiting the 

transport of a substrate under conditions which permit the movement 
of said substrate across a membrane; 
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b) 



measuring the augmentation or inhibition of transport of said 
compound by said modulator. 



17. 



A nucleic acid molecule of claim 2, wherein said 



nucleic acid molecule 



10 



15 



20 



comprises the OATP gene, or a complement of the OATP gene, contained in ATCC 
Accession Number 207209. 

1 8. A nucleic acid molecule of claim 2, wherein said nucleic acid molecule 
comprises the OATP gene, or a complement of the OATP gene, contained in ATCC 
Accession Number 207210. 

19. A nucleic acid molecule of claim 2, wherein said nucleic acid molecule 
comprises the OATP gene, or a complement of the OATP gene, contained in ATCC 
Accession Number 2072 11. 

20. A nucleic acid molecule of claim 2, wherein said nucleic acid molecule 
comprises the OATP gene, or a complement of the OATP gene, contained in ATCC 
Accession Number 207212. 

21. A nucleic acid molecule of claim 2, wherein said nucleic acid molecule 
comprises the OATP gene, or a complement of the OATP gene, contained in ATCC 
Accession Number 207213. 

22. A nucleic acid molecule of claim 2, wherein said nucleic acid molecule 
comprises the OATP gene, or a complement of the OATP gene, contained in ATCC 
Accession Number 2072 1 4. 
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Tissue Key 

H: heart S: spleen 

B: brain Ty: thymus 

P: placenta Pr: prostate 

L: lung T: testis 

Lv: liver O: ovary 

Sm: skeletal Si: small intestine 

muscle C: colon 

K: kidney Bl: peripheral blood leukocytes 

Pn: pancreas 
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